Diligence report Infrastructure / DevTools Series B 2026-06-13

Code Metal

Verification-First AI Code Translation for Mission-Critical Systems

Code Metal has a differentiated verification-first product and credible defense and industrial demand signals, but at the disclosed $1.25B Series B price the public KPI record is still too thin to underwrite without more diligence.

Cover facts

Latest round 01

125 USD M [CO022]

Latest valuation 02

1250 USD M [CV001]

Total raised 03

177.95 USD M [CO025]

Founded 04

2023 [CO005]

Headquarters 05

Boston, MA [CO006]

Named customers 06

RTX, L3Harris, Toshiba, U.S. Air Force [CO028]

Company profile

Code Metal is a Boston-based private software company founded in 2023 that sells verifiable code translation and optimization for mission-critical systems. Its product narrative combines LLM-assisted translation with formal-methods-style proof generation and hardware-aware deployment, aimed at legacy and performance-sensitive software estates across defense and regulated industry. Public proof is strongest on financing momentum and a small set of named accounts, while the core diligence question is whether today's high-touch deployments can become durable software economics at scale.

Website: codemetal.ai
Founders: Peter Morales, Alex Showalter-Bucher
Founding location: Boston, MA
Headquarters: Boston, MA
Product: Code Metal sells AI-assisted, proof-backed code translation and optimization software for moving code across programming languages, toolchains, and hardware targets in mission-critical environments.
Customers: Defense programs and primes, plus industrial enterprises in semiconductor, automotive, telecom, and other regulated environments with legacy or hardware-specific code bases.
Business model: High-touch enterprise and government software motion with custom pricing, forward-deployed implementation support, and partnership-led expansion rather than self-serve SaaS packaging.
Stage: Series B
Funding status: Code Metal disclosed a $125M Series B at a $1.25B valuation in February 2026 after a $36.5M Series A at a $250M valuation in November 2025; disclosed cumulative financing totals about $177.95M.

[CO001, CO002, CO005, CO006, CO008, CO009, CO010, CO016]

Executive summary

Top strengths

Verification-first code translation for mission-critical and hardware-specific software is a clear wedge versus generic coding assistants.
Public evidence supports real customer relevance with named accounts including RTX, L3Harris, Toshiba, and the U.S. Air Force.
Financing momentum is exceptional, moving from seed through a $125M Series B in under two years.
Founder-market fit is strong, with technical leadership tied to MIT Lincoln Laboratory and defense software.

Top risks

The $1.25B valuation moved far ahead of public operating disclosure, leaving ARR, gross margin, and retention opaque.
Public customer proof is concentrated in a short, defense-heavy logo set with limited independent case studies or procurement records.
Hiring and role mix imply a high-touch delivery burden, so the services-to-platform transition is not yet clearly proven.
Mission-critical deployments face certification, security, and procurement friction that can slow revenue recognition and expansion.

Open gaps

ARR, gross margin, revenue mix, burn, and cash runway remain undisclosed.
Customer count, retention, contract duration, and concentration by account are still not public.
Series B terms beyond headline size and valuation, including preferences, secondary mix, and debt, are not disclosed.
Independent proof of contract values, programs-of-record status, and repeat deployment economics is still incomplete.

Chapter 01

01Company Overview

1.1 Identity, Product, and Operating Footprint

Code Metal presents itself as a software infrastructure company for environments where correctness matters more than speed alone. Its homepage says the mission is to make AI trustworthy and describes the offering as verifiable code translation for industries where every line of code matters, while the product page shows configuration around CPUs, GPUs, FPGAs, toolchains, and resource limits rather than a generic chat-style coding assistant. The official research pages reinforce that positioning by arguing that testing is insufficient for mission-critical software and that formal methods plus LLMs can produce code that is both generated and proved. Public category messaging shifted over time: the July 2024 seed announcement described AI-powered development workflows for the edge, while the later Series A and Series B materials more clearly framed the company around provably correct or verifiable code translation. The industry focus is also consistent across sources, centering defense, automotive, semiconductor, industrial, and robotics workloads. Headquarters evidence is strongest for Boston: the Series B announcement is datelined Boston, Geekwire calls the company Boston-based, and SEC search results show a Boston business address. At the same time, the careers page and investor materials imply a distributed footprint across Boston, San Francisco, remote roles, and possibly Washington, D.C., so the operating organization looks more distributed than a single-office label suggests.[CO001, CO002, CO003, CO004, CO005, CO006]

Snapshot KPI Table
Metric	Value / Status	Date	Confidence	Gap / Note
Founded	2023	2023	high	Public sources consistently support the founding year, but no exact incorporation day is visible in retained evidence.
Headquarters	Boston, Massachusetts	2026-02-19	high	Official announcement, SEC address, and independent coverage align on Boston as the clearest HQ anchor.
Current stage	Series B private company	2026-02-19	high	Latest disclosed round and valuation indicate a late-stage private posture.
One-line product	Verifiable code translation and optimization for mission-critical systems	2026-06-13	high	Framing is consistent across homepage, product, and research materials.
Latest round	$125M Series B led by Salesforce Ventures	2026-02-19	high	Official announcement and multiple follow-on reports corroborate the amount and lead investor.
Latest valuation	$1.25B	2026-02-19	high	Valuation is public, but the underlying operating metrics supporting it are not.
Total disclosed raised	~$177.95M	2026-02-19	medium	Computed as $3.45M pre-seed + $13M seed + $36.5M Series A + $125M Series B; not stated as a single company figure.
Named public customers	Toshiba, RTX, L3Harris, U.S. Air Force (4 named accounts)	2026-02-19	high	Named-customer evidence is qualitative; no customer-count disclosure is public.
Hiring footprint	Boston, San Francisco, remote; investor page also lists Washington, D.C.	2026-06-13	medium	Public pages support multi-location hiring, but not a formal office roster.
Open roles	At least 17 named openings	2026-06-13	medium	Visible count from the careers page, not a full recruiting pipeline export.
Revenue / ARR / exact headcount	null	2026-06-13	low	No public audited revenue, ARR, customer count, or current employee total appears in retained sources.

Mixes disclosed financing and customer signals with explicit null-style gaps where private company metrics are unsupported; the literal string null indicates not publicly disclosed in retained evidence.

[CO005, CO006, CO007, CO022, CO025, CO028]

FO002: Company Snapshot Logic

Shows how Code Metal links formal methods, runtime translation, strategic end markets, customers, capital, and concentration risks.

[CO001, CO002, CO004, CO016, CO025, CO028]

FO003: Snapshot KPIs

Condenses the supportable public scale indicators while excluding unsupported private-company economics.

Uses disclosed figures or direct page counts only; omitted economics remain intentionally absent because no retained public evidence supports them.

[CO022, CO025, CO028, CO033, CO037]

1.2 Founders, Leadership, and Governance Concentration

Leadership disclosure is real but still narrow. Peter Morales is consistently identified as founder and CEO, Alex Showalter-Bucher as co-founder, and Ryan Aytay as the senior operator added in 2026 as President and COO after leading Tableau. The founder-market-fit case is credible: launch and investor materials tie Morales and Showalter-Bucher to MIT Lincoln Laboratory, defense systems work, and F-35-related experience, which matches Code Metal’s go-to-market emphasis on regulated hardware and defense-adjacent environments. Technical depth also appears broader than the two founders alone. Company research, the LLMLift paper, and the Metalift ecosystem show the startup anchored in formal-methods and verified-lifting ideas, while UCSD professor Loris D’Antoni publicly identifies himself as a Scholar at Code Metal. Even so, governance transparency lags fundraising visibility. The retrieved About page shows values and the existence of a team page, but not a stable named leadership roster in text; public materials do not disclose a full board, ownership structure, or investor rights package. Ryan Aytay’s arrival reduces some founder concentration on the operating side, but the public narrative still leans heavily on Morales as chief technical and commercial spokesperson. That makes key-person dependence and governance opacity material diligence points rather than housekeeping issues.[CO008, CO009, CO010, CO011, CO012, CO013]

Leadership and Founder Table
Person	Role	Background	Founder-market fit / functional coverage	Key-person dependency
Peter Morales	Founder and CEO	Public sources tie Morales to Microsoft, MIT Lincoln Laboratory, and earlier defense/F-35 related software work.	Primary bridge between formal-methods narrative, defense credibility, fundraising, and customer storytelling.	High — Morales is the most visible public spokesperson across funding, product, and investor materials.
Alex Showalter-Bucher	Co-founder	Launch and investor sources place him with Morales in the MIT Lincoln Laboratory and defense-systems orbit.	Anchors co-founder technical credibility and early product conception in mission-critical software translation.	High — publicly named as co-founder, but less visible than Morales in later announcements.
Ryan Aytay	President and COO	Former Tableau CEO who joined in 2026 after a long Salesforce career.	Adds scaling, commercial, and enterprise-operating experience to a previously founder-heavy leadership story.	Medium — important operating hire, but public remit and governance authority are not yet fully disclosed.
Loris D'Antoni	Scholar at Code Metal	UCSD professor focused on software trust, formal methods, and specification-aligned language-model systems.	Shows access to external formal-methods bench beyond the founding pair.	Low to medium — supports technical depth, but public materials do not describe executive authority.

Partial roster only. The retrieved About page signals a team but does not expose a complete named management or board directory in text.

[CO008, CO009, CO010, CO011, CO012, CO013]

1.3 Funding History, Investor Map, and Strategic Overlap

The disclosed funding path is unusually fast for a company founded in 2023. In July 2024, Code Metal announced a $13 million seed led by Shield Capital and disclosed a prior $3.45 million pre-seed led by J2 Ventures. In November 2025, the company announced a $36.5 million Series A led by Accel at a $250 million valuation, while CNBC independently covered the same financing at roughly $36 million, suggesting press rounding rather than a different round. In February 2026, Code Metal announced a $125 million Series B led by Salesforce Ventures at a $1.25 billion valuation, with BusinessWire, Wired, and follow-on trade coverage broadly matching the headline. Summing the disclosed rounds yields about $177.95 million of publicly visible financing, although that total is arithmetic rather than a company-published lifetime-capital figure. The investor mix matters strategically. Defense-oriented backers such as Shield, J2, Overmatch, and RTX sit alongside enterprise-software investors such as Accel, B Capital, Salesforce Ventures, and Smith Point. That composition can help hiring, customer access, and procurement credibility, but it also introduces overlap risk in diligence. RTX is both a named investor and a named customer in public materials; Bosch Ventures is a strategic industrial investor, but public sources in hand do not show whether Bosch is also an operating customer. The cap table therefore appears strategically useful, but not cleanly independent from commercial proof points.[CO017, CO018, CO019, CO020, CO021, CO022]

Stakeholder or Investor Map
Stakeholder	Role	Control / economic importance	Diligence ask
Salesforce Ventures	Series B lead investor	Led the $125M Series B at the $1.25B valuation and likely has meaningful influence on later-stage financing expectations.	Confirm ownership, board or observer rights, and whether Salesforce ecosystem access is contractual or purely relational.
Accel	Series A lead; Series B participant	Anchors the first large-scale institutional round and appears again in the B, signaling continuity and reserve support.	Verify current ownership, follow-on behavior, and any governance rights carried from the Series A.
Shield Capital	Seed lead investor	Early defense-oriented backer that framed the company around national-security and mission-critical software relevance.	Clarify pro rata rights, defense-network support, and whether Shield still shapes go-to-market introductions.
J2 Ventures	Pre-seed lead; later participant	Earliest disclosed institutional sponsor, important for cap-table history and defense-ecosystem connectivity.	Request original terms, ownership dilution path, and any special information or protective rights.
RTX / RTX Ventures	Strategic investor and named customer overlap	Most visible overlap between financing and commercial proof because RTX is both on the cap table and in the public customer list.	Test how much customer proof is independent of investor influence and whether RTX has procurement, pilot, or reference constraints.
Bosch Ventures	Strategic Series A investor	Brings industrial adjacency and could support automotive or embedded-system credibility, but operating overlap is not publicly described.	Ask whether Bosch is a customer, design partner, channel, or purely financial investor.
Overmatch VC	Defense-focused investor	Signals alignment with defense and deep-tech procurement circles rather than only generalist software capital.	Clarify whether Overmatch provides customer access, recruiting support, or government-program introductions beyond capital.
Smith Point Capital	Enterprise software investor	Adds enterprise-operator brand signal to a syndicate otherwise heavy in defense and technical-specialist capital.	Determine whether support extends to enterprise go-to-market, hiring, and later-stage financing strategy.

Maps only publicly named stakeholders from disclosed financing materials and investor sites; ownership percentages, liquidation preferences, debt, and secondary components remain undisclosed.

[CO018, CO019, CO022, CO023, CO024, CO026]

1.4 Public Scale Signals, Milestones, and Evidence Gaps

Public scale evidence is directionally positive but still incomplete. The strongest externally visible customer proof is qualitative rather than quantitative: by February 2026, Code Metal and its BusinessWire release named Toshiba, RTX, L3Harris, and the U.S. Air Force as customers using the product to move between languages and optimize for hardware. At Series A, the company also claimed it was already on contract to deliver eight figures in revenue that year, but there is no public audited revenue, ARR, customer-count, or margin disclosure in the retained sources. Headcount is similarly opaque. A July 2024 launch story said the company employed seven people at that stage, and the current careers page lists at least 17 named openings across engineering, research, operations, finance, and solutions, but there is no verified current employee total. The milestone record is still strong enough to sketch a chronology: a 2023 founding, December 2023 Form D presence, July 2024 seed disclosure, a March 2025 USSOCOM-linked hackathon, November 2025 Series A coverage, and February-March 2026 Series B announcement plus Form D filing. The main caution is analytical rather than legal. Wired explicitly noted that methodologies in AI code tooling remain unproven and that investors are making category bets, while Code Metal’s own valuation and growth narrative depends heavily on company announcements, investor theses, and picked-up press. The company also maintains undated landing pages for Forbes and Wired coverage, but the retrieved text does not expose publication metadata, so those are weaker milestone anchors than dated filings and financing stories.[CO021, CO028, CO029, CO030, CO031, CO032]

Milestone Table
Date	Event	Type	Amount / valuation / status	Participants	Implication
2023	Company founded	founding	Founding year public; exact day not disclosed	Peter Morales; Alex Showalter-Bucher	Establishes Code Metal as a very young company relative to its 2026 valuation.
2023-12-20	First visible Form D filing appears in SEC search results	financing	Form D present	Code Metal	Shows external filing evidence that capital formation began during the founding window.
2024-07-23	Seed announcement discloses prior pre-seed	financing	$13M seed plus prior $3.45M pre-seed	Shield Capital; J2 Ventures; Fulcrum Venture Group	Creates the first clear public financing baseline and confirms two early rounds rather than one.
2024-08-01	SEC search shows 2024 Form D entry	financing	Form D present	Code Metal	Adds filing confirmation after the July 2024 funding announcement.
2025-03-14 to 2025-03-16	Metal Ops hackathon runs for USSOCOM-focused smart-city concepts	partnership	$10K prize event; defense-facing ecosystem signal	Code Metal; USSOCOM-linked participants	Shows brand-building and relationship development inside the defense ecosystem before the later venture step-up.
2025-11-12	CNBC covers the Accel-led Series A	financing	~$36M media-reported financing	Code Metal; Accel; CNBC	Provides independent timing for the company's move into larger institutional funding.
2025-11-13	SEC search shows 2025 Form D entry	financing	Form D present	Code Metal	Supports the existence of a late-2025 financing event immediately after press coverage.
2026-02-19	Series B announced and Ryan Aytay joins	financing	$125M at $1.25B valuation	Salesforce Ventures; Accel; B Capital; Smith Point; J2; Shield; Overmatch; RTX; Ryan Aytay	Marks the step-up from growth-stage startup to unicorn valuation with an added operating executive.
2026-03-12	SEC search shows 2026 Form D entry	financing	Form D present	Code Metal	Adds filing confirmation after the Series B announcement.
2026-06-13	Careers page still shows broad multi-function hiring	scale	At least 17 named openings visible	Code Metal	Suggests the company remains in build-out mode after the Series B rather than shifting into hiring freeze or consolidation.

This is the chronology of record for retained public evidence. Funding rows mix announcement dates and later SEC filing dates because both are material and publicly visible.

[CO005, CO011, CO017, CO021, CO022, CO033]

FO001: Company Milestone Timeline

Tracks Code Metal from 2023 founding through early financing, defense-ecosystem signaling, and the 2026 unicorn round.

Where only a year is public, the timeline preserves the coarser date instead of inventing a day.

[CO005, CO011, CO017, CO020, CO021, CO022]

Chapter 02

02Market Analysis

2.1 Market boundary: high-assurance modernization, not generic copilots

Code Metal should be placed in the market for AI-driven code modernization, translation, and verification for mission-critical software, not in the broad market for generic developer copilots. Its product surface asks users to specify CPUs, GPUs, FPGAs, toolchains, and runtime resource limits, while its research pages argue that testing alone is insufficient when software controls avionics, semiconductor tooling, autonomous systems, embedded devices, or regulated infrastructure. The company and investor narrative is consistent on the wedge: decades-old code written in C, C++, MATLAB, ADA, CUDA, and other legacy or hardware-coupled forms must be moved onto newer architectures without giving up correctness. That boundary is narrower than AI coding and even narrower than general application modernization. Included spend is the software and engineering budget used to translate legacy code, prove or verify behavior, integrate with constrained runtimes, and generate evidence acceptable to security, quality, and certification stakeholders. It includes defense embedded systems, aerospace and industrial edge controllers, semiconductor and accelerator software, and other high-stakes environments where rewrite failure is expensive. Excluded spend includes greenfield software development, casual code-completion tools, and generic chat-style copilots that improve coding velocity but do not solve hardware-portability or assurance bottlenecks. Status-quo substitutes are manual rewrites, specialist kernel/compiler teams, internal platform engineering, static-analysis and testing tools, and incumbent assurance vendors.[CM001, CM002, CM003, CM004, CM005, CM006]

Market definition table
Segment / category	Included spend	Excluded spend	Buyer / payer	Relevance
High-assurance code translation	Legacy-to-modern language migration, hardware retargeting, proof or verification workflows	Generic greenfield coding assistants	Defense primes, platform engineering groups, program-funded modernization teams	Core Code Metal wedge
Embedded and edge software modernization	Runtime-constrained code for CPUs, GPUs, FPGAs, avionics, robotics, industrial controllers	Generic enterprise app refactoring with no hardware constraint	OEM software teams, embedded platform leads, systems integrators	Important because product messaging is runtime- and hardware-aware
Semiconductor and accelerator portability	CUDA or hardware-coupled kernels, compiler and toolchain integration, accelerator migration	Cloud-only AI app scripting	Architecture teams, compiler/platform leaders, performance engineering budgets	Explicitly supported by Code Metal portability materials
Mission-critical assurance and certification support	Evidence generation, secure delivery, memory-safety and trust controls around modernized code	Pure QA outsourcing or undifferentiated bug triage	Engineering leaders, security/assurance functions, program offices	Where formal verification and trust requirements matter most
Status-quo substitutes	Manual rewrites, specialist kernel teams, internal tooling, static analysis, testing, assurance vendors	Purpose-built end-to-end modernization platform	Same budgets as above	What a Code Metal-like vendor must displace to win spend

The table intentionally narrows the market to modernization and assurance spend for legacy or hardware-coupled software, not to total AI coding or all application-modernization spend.

[CM001, CM003, CM004, CM005, CM007, CM008]

2.2 Sizing with evidence-constrained lenses rather than one headline TAM

Public evidence supports sizeable adjacent budgets but not one clean published TAM for verified AI code modernization. The broadest commercial lens is legacy application modernization: Grand View Research estimates a USD 17.8 billion global market in 2023 growing to USD 52.46 billion by 2030, driven partly by the high maintenance cost and operational drag of legacy applications. A second adjacent lens is software assurance and code security: Mordor Intelligence estimates the application security market at USD 14.83 billion in 2026 and USD 28.11 billion by 2031, with scanning increasingly embedded into continuous integration pipelines. Those figures are too broad for Code Metal’s wedge, but they show that modernization and verification already command real budget categories. A third lens is mission-critical modernization inside defense and regulated engineering organizations. Official DoD documents do not publish a clean line item for Code Metal’s exact category, yet they do show software modernization and rapid secure delivery as strategic priorities. Because the public record lacks standalone pricing, contract volumes, or procurement-conversion data for verifiable code-translation platforms, the most defensible estimate is a wedge analysis rather than false precision: roughly USD 0.6-1.8 billion of near-term annual spend appears plausible for the serviceable market spanning defense primes, government software programs, semiconductor platform teams, and industrial or aerospace OEM software groups; a broader USD 1.8-4.5 billion opportunity becomes plausible only if buyers fund both migration and ongoing verification across many architectures and programs. The uncertainty is material and should be preserved, not averaged away.[CM011, CM012, CM013, CM014, CM015, CM016]

TAM / SAM / sizing-lens table
Publisher / lens	Year	Geography	Value	CAGR / volume	Methodology	Confidence	Limitation
Grand View application modernization services	2023-2030	Global	USD 17.8B in 2023; USD 52.46B by 2030	16.7% CAGR	Broad legacy-application modernization-services market	medium	Far broader than mission-critical code translation
Mordor application security market	2026-2031	Global	USD 14.83B in 2026; USD 28.11B by 2031	13.64% CAGR	Broad assurance and application-security tooling market	medium	Includes many security products irrelevant to code migration
DoD software modernization priority lens	FY25-26	US defense	No clean standalone dollar wedge isolated in retained public sources	Strategic priority, secure iterative delivery	Official modernization plans and software pathway rules	medium	Priority is clear, direct category budget is opaque
Report estimate: near-term serviceable market	2026	US plus allied mission-critical buyers	USD 0.6B-1.8B	n/a	Wedge across defense, semiconductor, aerospace, industrial-edge modernization and assurance budgets	low	Derived estimate; public pricing and conversion data are sparse
Report estimate: broader multi-sector TAM	2026	Global regulated and mission-critical buyers	USD 1.8B-4.5B	n/a	Assumes recurring spend on both migration and verification across many architectures	low	Depends on adoption beyond pilots and beyond defense

Observed analyst-market totals are used only as adjacent budget pools; the two report estimates are evidence-constrained wedges, not published market statistics.

[CM011, CM012, CM013, CM014, CM015, CM016]

FM001: Market sizing lens

Broad modernization and assurance budgets narrow sharply when the lens is restricted to mission-critical portability plus formal-assurance use cases.

All values are shown in USD millions. The upper two layers are adjacent public market pools rather than perfectly nested categories; lower layers are report estimates used to bound the likely wedge.

[CM011, CM013, CM019, CM020, CM021, CM046]

FM002: Market estimate range

The narrow market is defensible only as a range because public pricing, procurement conversion, and verification-throughput data are incomplete.

Each row uses USD millions. The first two are evidence-constrained report estimates, not published analyst values, and the SOM row is illustrative rather than company-guided.

[CM019, CM020, CM021, CM046]

2.3 Buyer, user, and payer segmentation

The daily user is likely an engineering team rather than a casual developer seat: embedded-software engineers, platform and compiler teams, GPU or accelerator specialists, verification engineers, and mission-software developers working inside large codebases that are coupled to specific hardware or certification regimes. Defense primes are the clearest early segment because they already manage software-intensive programs under delivery and acceptance milestones, while government program offices and sponsors help determine whether a pilot can become a funded program. In semiconductors and platform software, the buyer is more likely a VP engineering, platform lead, or architecture team that needs to move workloads across heterogeneous hardware while preserving performance and correctness. In industrial, aerospace, and robotics settings, the budget tends to sit with product-software, embedded-platform, or modernization leaders who own long-lived edge software. The payer therefore changes by segment, but it is usually not an individual developer. It is an engineering, program, or modernization budget that justifies spending through avoided rewrite labor, faster hardware migration, lower certification friction, or fewer defects in deployment. Systems integrators and defense IT contractors also matter because they sit between government demand and the actual engineering work; Booz Allen and SAIC both market AI and digital modernization to national-security customers, validating that intermediary layer. Adoption is likely to start with a bounded pilot or minimum viable capability release, then expand only after integration into toolchains, build systems, verification workflows, and operational-acceptance processes. That dynamic favors vendors who can support enterprise onboarding, not just self-serve product usage.[CM022, CM023, CM024, CM025, CM026, CM027]

Segment / buyer map
Segment	Buyer	User	Payer	Budget owner	Adoption trigger
Defense prime engineering team	Program engineering leader	Embedded, mission, toolchain, and verification engineers	Prime or contract-funded program budget	VP engineering, program manager, chief engineer	Legacy program code must move faster without breaking acceptance
Government program office	Program executive office or sponsor	Government technical staff plus integrator teams	Appropriated software or modernization budget	Program manager / sponsor	Need secure iterative releases and acceptable evidence for deployment
Semiconductor / platform team	Platform architecture or compiler lead	Kernel, compiler, accelerator, and performance engineers	Platform R&D budget	VP engineering or architecture lead	Need to retarget code across heterogeneous hardware without manual rewrites
Industrial / aerospace OEM software team	Product software or embedded-platform leader	Edge, controls, avionics, robotics, or plant software engineers	Product engineering or modernization budget	CTO, VP software, or systems lead	Long-lived edge software must adapt to new hardware and trust requirements
Systems integrator / mission IT contractor	Account or delivery leader	Delivery engineers and modernization teams	Government contract or task-order budget	Mission IT / delivery executive	Acts as intermediary when government demand and engineering execution are split

Rows describe likely commercial motions, not disclosed Code Metal contracts. Budget ownership is inferred from official procurement pathways, product constraints, and how mission-software work is typically staffed.

[CM022, CM023, CM024, CM025, CM026, CM027]

FM003: Buyer / segment map

Early demand is strongest where hardware portability pain, assurance burden, and centralized budget authority overlap.

The matrix is qualitative and synthesized from retained sources; it is intended to rank segment fit, not to quantify win rates.

[CM022, CM023, CM024, CM025, CM026, CM027]

FM004: Adoption funnel or value-chain map

Winning the market requires moving from code pain to deployment approval; procurement and assurance steps are part of the product challenge.

This is an adoption flow rather than a numeric funnel. The gating steps are based on public defense-software pathway documents and category evidence on assurance needs.

[CM023, CM027, CM028, CM037, CM038, CM048]

2.4 Growth drivers, certification friction, and market limits

Several forces push this market forward. Hardware churn is real: Code Metal’s own portability writing is explicitly about what it takes to move CUDA workloads off NVIDIA or move legacy code onto newer accelerators, and investor materials frame infrastructure debt in defense, semiconductor, and aerospace systems as a compounding problem. At the same time, security and trust requirements are tightening. CISA and NSA recommend memory-safe approaches for national security systems and critical infrastructure, while NIST is extending trustworthy-AI guidance toward critical infrastructure operations. Those pressures increase the value of tooling that can modernize code without sacrificing evidence of correctness. The constraints are equally important. DARPA’s ARCOS program states directly that current DoD certification practices are antiquated and unable to scale because they rely on human evaluators and poorly decomposed evidence. DoDI 5000.87 and the broader software-modernization plan encourage iterative delivery, but they also confirm that defense adoption runs through programmatic gates, operational acceptance, and ongoing oversight. Technical scalability is not proven either: Code Metal’s own translation papers show domain-specific gains, but broader research such as UniPar still reports modest compilation and correctness rates for complex parallel-code translation, and empirical work on Copilot-generated code finds material security weaknesses. The result is a market with genuine demand but unresolved proof burdens: integration into messy legacy repositories may stay services-heavy, and the commercial upside depends on proving that verification economics scale beyond bespoke pilots.[CM032, CM033, CM034, CM035, CM036, CM037]

Growth drivers and constraints table
Driver / constraint	Direction	Timing	Implication	Diligence ask
Legacy code debt in hardware-coupled systems	Positive	Current	Creates rewrite-avoidance demand and modernization urgency	How much of deployment is reusable product versus bespoke services?
Hardware churn and portability away from single-vendor stacks	Positive	Current to medium term	Raises value of verified retargeting across CPUs, GPUs, FPGAs, and accelerators	Which architectures are already proven in production?
Memory-safety and trustworthy-AI pressure	Positive	Current to medium term	Pushes buyers toward higher-assurance modernization paths	Can the platform emit evidence auditors and security reviewers accept?
DoD certification and procurement friction	Negative	Current	Programmatic gates can delay conversion from pilot to scaled contract	What is average time from evaluation to funded program of record?
Verification scalability and integration risk	Negative	Current	Large codebases and bespoke toolchains may keep onboarding services-heavy	What third-party benchmarks exist for repo scale, proof throughput, and labor savings?

The table mixes market drivers and adoption brakes because both shape serviceable demand. Timing is qualitative because public conversion-cycle data are not disclosed.

[CM032, CM033, CM036, CM037, CM038, CM039]

2.5 Exhibits

Chapter 03

03Competitors

3.1 Landscape and Nearest Alternatives

The buyer does not evaluate Code Metal against one neat peer set. The landscape breaks into four practical categories. First are high-assurance or formal-methods-oriented peers such as GrammaTech and Galois, which sell software-assurance, security, and high-assurance solutions to defense and other regulated environments. These are the closest substitutes on trust and verification posture, and both have longer public track records than Code Metal. GrammaTech's public learn hub is likewise organized around cybersecurity and software assurance rather than around code-translation workflow. Second are adjacent AI code-quality tools such as Diffblue, Snyk, and Sonar. They compete for part of the same engineering budget because they promise faster test generation, static analysis, or AI-era code verification, but their public positioning is narrower than Code Metal's modernization-plus-portability narrative. Third are modernization incumbents such as IBM, which can wrap coding assistance into a broader governed AI and application-modernization stack. Fourth are defense-services substitutes such as Booz Allen, SAIC, and internal engineering teams, which can pursue modernization through services engagements or custom builds instead of buying a new product vendor. Across those groups, most alternatives do not advertise the exact combination of code translation, target-hardware portability, and formal verification that Code Metal emphasizes. The adverse read is that the market may still reward incumbency, contract access, and installed base more than feature novelty.[CP001, CP002, CP003, CP004, CP007, CP009]

Competitor Profile Table
Vendor	Category	Scale / Track Record Signal	Primary Buyer or Segment	Public Differentiation	Limitation versus Code Metal
Code Metal	AI code translation / high assurance	2024-founded startup; investor and media materials emphasize defense growth	Mission software, modernization, edge / hardware-portability programs	Translation plus heterogeneous target portability plus formal-verification-centric messaging	Young installed base; procurement proof and pricing transparency remain limited
GrammaTech	Software assurance / cyber-security	30+ years of cyber innovation	Defense, software assurance, security engineering	Deep assurance and vulnerability-analysis credibility	Public positioning is broader assurance tooling, not Code Metal-style AI transpilation
Galois	High-assurance R&D / tools	Established high-assurance specialist across defense and semiconductors	Aerospace, defense, semiconductors, regulated engineering	High-assurance reputation and cross-sector technical credibility	Public site does not foreground AI translation portability as the core offer
Diffblue	AI testing agent	Enterprise unit-testing specialist	Engineering teams seeking automated unit tests	Automated enterprise unit-test generation	Narrower than full modernization, portability, or proof-oriented translation
Snyk	AI security / SAST	Large dev-security platform	AppSec, DevSecOps, platform engineering	Developer-first code security and auto-fix workflow	Competes for adjacent quality budget, not end-to-end code migration
Sonar	Code verification / quality	7M+ developers; 75% of Fortune 100 claimed	Developer platform, quality, engineering leadership	Massive installed base and AI-era verification branding	Public messaging is broad code quality, not heterogeneous code translation
IBM watsonx + AI coding agent	Modernization incumbent	Global enterprise platform reach	CIO, enterprise architecture, app modernization	Governed AI plus coding-agent modernization narrative	Less public emphasis on formal verification or hardware-portability specificity
Booz Allen / SAIC	Defense-services substitutes	Federal AI and mission-IT incumbency	Program office, integrator-led modernization, services budgets	Existing services relationships and procurement familiarity	Not a like-for-like product answer to Code Metal's exact workflow
Internal build / open source	Status quo substitute	Available to large technical teams	Defense primes, semiconductor firms, advanced internal platforms	Avoids vendor lock-in and can be customized deeply	Requires buyers to carry integration, proof, and maintenance burden themselves

Rows combine official vendor positioning with inferred buyer mapping as of 2026-06-13; scale signals are directional because most private vendors do not publish full contract or revenue detail.

[CP001, CP004, CP007, CP009, CP011, CP013]

FP001: Competitive Positioning Map

Ordinal map of named alternatives on two evidence-backed axes: verification posture (x) and installed-base or procurement reach (y). Higher scores indicate stronger public emphasis or stronger publicly visible distribution.

Axis values are ordinal judgments derived from public positioning rather than benchmarked metrics; the purpose is to show relative competitive shape, not a quantified market score.

[CP007, CP009, CP011, CP013, CP014, CP015]

3.2 Capability Breadth and Verification Posture

Capability comparison is less about who also mentions AI and more about where each vendor sits in the workflow. Code Metal markets verifiable code translation into chosen runtime environments, including different CPU and accelerator targets, and supports that story with formal-verification-oriented research. GrammaTech and Galois are closest on assurance credibility, but their public materials lead with software analysis, cyber-security, high-assurance solutions, and software-assurance thought leadership rather than AI-led heterogeneous transpilation. Diffblue is centered on enterprise unit-test generation, with company materials emphasizing autonomous test writing and maintenance. Snyk Code and Sonar target secure code review, static analysis, and AI-era verification for broad developer populations, with SonarQube marketed as code verification for the AI era. IBM's AI coding agent and watsonx Code Assistant compete from the opposite direction: not via deep proof-first messaging, but via enterprise modernization breadth, governed AI, and integration with existing infrastructure. That means Code Metal's exact feature mix appears differentiated, yet it also means the company must prove that buyers care about translation correctness and hardware portability enough to choose a more specialized product instead of accepting a broader but less assurance-specific incumbent or a narrower adjacent tool. For diligence, the most important competitive question is whether verification is a budget-determining buying criterion or only a tie-breaker once distribution and procurement are already decided.[CP003, CP007, CP008, CP009, CP011, CP012]

Feature / Capability Matrix
Buying Criterion	Code Metal	GrammaTech	Galois	Diffblue	Snyk	Sonar	IBM
AI-led code translation	Yes	Unknown	Unknown	No	No	No	Partial
Heterogeneous hardware target portability	Yes	Unknown	Unknown	No	No	No	Partial
Formal-verification-centric messaging	Yes	Partial	Partial	No	No	No	No
Software assurance / vulnerability analysis	Partial	Yes	Yes	Partial	Yes	Yes	Partial
Automated unit-test generation	Unknown	Unknown	Unknown	Yes	No	No	Unknown
Broad enterprise governed AI stack	No	No	No	No	No	No	Yes
Installed-base advantage visible in public copy	No	No	No	No	Partial	Yes	Yes
Defense-services-led substitute path	No	Partial	Partial	No	No	No	Partial

Yes/Partial/No/Unknown cells reflect only what could be substantiated from cited public pages; Unknown means the capability was not confirmable from the retained evidence, not that it is absent.

[CP003, CP007, CP008, CP009, CP011, CP012]

FP002: Feature Breadth / Capability Map

Publicly documentable capability coverage across key buyers' criteria relevant to mission-critical modernization accounts.

Cells mark only what was supportable from cited public materials. Unknown is used deliberately where public evidence did not substantiate a capability one way or the other.

[CP003, CP007, CP009, CP011, CP012, CP013]

3.3 Deployment, Pricing, and Budget Owner

The practical buyer map is fragmented. Code Metal is most likely to sell into engineering, modernization, or mission-software program budgets that care about moving legacy code onto new runtimes without sacrificing correctness. Adjacent tools point at different owners: Snyk toward AppSec and DevSecOps, Sonar toward broad developer-platform or quality leadership, and Diffblue toward QA or developer productivity. IBM can approach the same account through enterprise architecture, application-modernization, or AI platform budgets, while Booz Allen and SAIC can pursue services-led program dollars. This matters because public pricing is sparse across the category. Most public pages describe capability, governance, or services posture but do not publish realized contract values, which limits precision in list-price comparisons and makes procurement-channel analysis more important than nominal price. The adverse implication is that larger incumbents can compress competition by bundling adjacent functionality into broader modernization stacks, and defense-services substitutes can enter through pre-existing relationships rather than through a greenfield software procurement. Code Metal therefore needs not just superior functionality, but a clean story for why its verification-oriented workflow deserves its own budget line instead of being absorbed into a larger platform or services award.[CP015, CP016, CP017, CP018, CP022, CP027]

Deployment / Pricing / Budget Owner Comparison
Vendor or Class	Deployment / Contract Model	Public Pricing Visibility	Likely Budget Owner	Procurement Channel Strength	Implication for Code Metal
Code Metal	Project-led product deployment around modernization workflows	Sparse; no complete public contract schedule	Engineering / modernization / mission software	Emerging product procurement	Must justify a distinct specialized tool budget
GrammaTech	Tooling plus assurance services	Sparse	Security engineering / assurance leadership	Established defense and assurance relationships	Can win on trust even without Code Metal's portability story
Galois	High-assurance solutions and tools	Sparse	Advanced engineering / R&D / regulated programs	Established technical credibility	Can appeal where formal-methods reputation outweighs startup novelty
Diffblue	Enterprise testing software	Sparse	QA / developer productivity	Typical enterprise software motion	Adjacency mostly in test automation budgets
Snyk / Sonar	Platform or seat-led developer tooling	Partial to sparse	AppSec / DevSecOps / platform engineering	Strong developer-platform reach	Can absorb adjacent spend before migration budgets form
IBM	Broader modernization and governed-AI platform	Sparse	CIO / enterprise architecture / modernization	Very strong incumbent enterprise channel	Can bundle adjacent functionality inside existing relationships
Booz Allen / SAIC	Services-led mission IT or AI engagements	Custom / not publicly standardized here	Program office / integrator services	Strong federal relationship path	Can substitute procurement convenience for feature match
Internal build	Labor and infrastructure spend	N/A	Internal engineering leadership	No vendor procurement required	Appeals to sophisticated teams seeking control over IP and workflows

Public pricing is directionally sparse across the category, so the table emphasizes contract model and likely budget owner more than list-price precision.

[CP015, CP016, CP017, CP018, CP022, CP027]

3.4 Switching Costs, Substitutes, and Moat Durability

Code Metal's moat is promising but not obviously permanent. If a customer adopts it deeply, the switching costs should come from translated code assets, target-hardware configuration knowledge, and verification workflows tied to specific modernization programs, rather than from classic seat-license lock-in. Those costs are meaningful, but they may begin at the project level instead of at company-wide platform scale, which gives incumbents time to respond. Internal build is also a real substitute for sophisticated defense or semiconductor teams: public academic tooling such as Metalift and general LLM advances make it possible for well-resourced engineering organizations to prototype their own translation-and-proof pipelines, albeit with heavier integration and maintenance burden. The biggest adverse evidence is structural rather than feature-based. GrammaTech and Galois have longer formal-methods reputations, IBM has broader modernization and governed-AI reach, Sonar has a far larger developer footprint, and Booz Allen or SAIC can win when procurement convenience outweighs product differentiation. Public evidence on realized pricing, specific contract vehicles, and direct benchmarked win rates remains thin, so the moat case today rests more on logical fit than on abundant public proof of displacement at scale.[CP020, CP025, CP026, CP030, CP031, CP034]

Moat Durability / Competitive Risk Register
Moat or Risk Area	Threat	Severity	Why It Matters	Mitigation / Diligence Ask
Exact feature mix	Incumbents still win despite weaker feature match	High	Installed base and procurement access may matter more than translation-plus-proof specificity	Request named win/loss evidence against IBM and services primes
Verification narrative	Peers with longer assurance track records out-credential Code Metal	Medium-High	Formal-methods reputation can influence regulated procurement	Gather customer references that cite verification as the purchase reason
Project-based switching costs	Early deployments may not yet create enterprise-wide lock-in	Medium	Incumbents can respond before Code Metal standardizes across the account	Measure expansion from first migration project to broader platform use
Internal build substitute	Sophisticated teams assemble custom tooling using open source and LLMs	Medium	Buyers may prefer control over workflow and IP	Quantify time-to-value versus internal build baselines
Pricing transparency	Sparse public pricing obscures true comparative TCO	Medium	Hard to prove savings or justify premiums from public evidence alone	Request realized pricing examples or procurement artifacts in diligence
Defense procurement channels	Services incumbents may enter through existing relationships or convenience	High	Contract access can beat product differentiation in defense accounts	Map which current customers required cleared services or partner channels

Severity assessments are diligence-stage judgments derived from public positioning and procurement context rather than from disclosed win-rate data.

[CP020, CP024, CP025, CP026, CP030, CP031]

FP003: Moat / Readiness KPIs

Compact summary of the competitive durability signals that matter most for Code Metal in public evidence.

[CP019, CP020, CP030, CP031, CP032, CP037]

3.5 Exhibits

Chapter 04

04Financials

4.1 Revenue model and pricing posture: contract visibility is real, but realized economics are not

Code Metal's public materials support a revenue model built around enterprise and government code-translation engagements rather than self-serve developer subscriptions. The homepage and product pages position the product as verifiable code translation for mission-critical industries, while the product surface emphasizes hardware targets, toolchains, and deployment constraints rather than published seat pricing. The strongest traction statement came in the November 2025 Series A announcement, where the company said it was already on contract to deliver eight figures in revenue that year across defense, automotive, and semiconductor deployments. That is a meaningful demand signal, but it is still a contract-delivery claim rather than an audited revenue disclosure, and the company does not disclose how much of that work is recurring platform revenue versus project-based translation or integration work. Pricing posture is correspondingly opaque. Neither the homepage nor product pages expose list prices, packages, or realized contract benchmarks, and no retained source discloses discounts, minimums, or usage-based terms. Public evidence instead points to negotiated enterprise and government contracting, likely with upfront scoping around repository segmentation, target hardware, verification requirements, and deployment support. The research pages and investor theses also show why buyers might pay for this: Code Metal is selling both portability across heterogeneous hardware and proof-oriented verification in environments where software failure can trigger recalls, certification problems, or mission failure. That makes the value proposition legible even though the pricing model remains undisclosed. The main revenue-quality risk is mix. Public evidence is strong that customers pay for high-stakes code migration and verification, but weak on whether those dollars already behave like repeatable software revenue. Hiring for forward-deployed and solutions roles suggests some contracts still need meaningful customer-specific labor. If Code Metal can standardize those workflows into reusable translation and verification modules, margins could improve materially; if not, the business may remain closer to project-led technical services wrapped around proprietary tooling.[CI001, CI002, CI003, CI004, CI005, CI006]

Revenue streams table
stream	mechanism	unit	current value/status	quality	diligence ask
Enterprise code translation and modernization contracts	Negotiated project or platform contract for migrating or optimizing critical codebases	Contract / program	Public contract existence is supported by named customers and the eight-figure Series A statement, but contract values are undisclosed	Medium — traction is visible, realized economics are not	Provide revenue by customer, contract type, and what portion is recurring versus one-time delivery
Government and defense program work	Program-of-record or mission software translation and verification work	Program / task order	U.S. Air Force and L3Harris are publicly named, but no public award dollars or backlog values are retained	Medium — strong logos, weak dollar transparency	Disclose government backlog, funded value, and whether revenue sits on prime or subcontract paper
Hardware portability and optimization engagements	Translation across CPUs, GPUs, FPGAs, NPUs, and toolchains for target hardware	Migration project / workload	Product and research sources show portability across heterogeneous hardware as a core use case	Medium — use case is clear, monetization package is not	Disclose whether portability work is billed per project, per target, or as a reusable platform subscription
Verification and compliance layer	Formal-proof, validation, and production-readiness checks layered onto translated code	Bundled module or premium scope	Verification is central to the value proposition, but no public source breaks out separate pricing	Low to medium — differentiation is clear, monetization is opaque	Show attach rate and pricing uplift for verification-heavy programs relative to translation-only work
Forward-deployed implementation support	Customer-specific integration, deployment, and solution design support	Milestone / labor effort	Careers evidence implies hands-on implementation remains important in delivery	Low — inferred from hiring, not directly disclosed as a revenue line	Break out professional-services revenue, billable utilization, and gross margin by deployment phase
Research-derived workflow and model reuse	Reusable translation workflows, proof techniques, and domain models applied across programs	Software leverage / IP reuse	Research activity is visible, but no public source quantifies revenue from reusable product components	Low — leverage thesis is plausible but unproven	Provide cohort evidence that later deployments require less custom work and produce higher gross margin

This table separates visible demand mechanisms from hidden economics. Public evidence supports contract-backed revenue opportunities, but not contract values, revenue recognition, or the platform-versus-services split.

[CI001, CI002, CI004, CI005, CI006, CI010]

Pricing / monetization table
offering or posture	price / unit / contract	list vs realized pricing	public evidence	implication	diligence ask
Core enterprise platform engagement	Negotiated contract; no public unit disclosed	No public list price and no realized benchmark disclosed	Homepage and product pages route buyers to direct engagement rather than a rate card	Pricing power may be strong, but public comparability is zero	Provide ACV range, pricing basis, contract duration, and minimum deal size
Government / defense program work	Negotiated program or contract vehicle	No public rate or award-value detail retained	Programs-of-record language is public, but no pricing mechanics are disclosed	Government revenue may be strategically valuable but hard to underwrite	Disclose prime/sub status, funded award value, and renewal / option structure
Hardware portability projects	Likely project- or scope-based pricing tied to target architecture and repository complexity	Realized pricing unknown	Product and research pages describe multi-architecture translation work, not standardized packaging	Economic outcome may vary sharply by customer complexity	Show pricing drivers such as target hardware count, codebase size, and verification scope
Verification-intensive delivery	Potential premium for proof, validation, and compliance needs	No public surcharge or module pricing disclosed	Official and investor sources make verification the differentiator, but no separate monetization is public	If verification commands premium pricing, gross margin may improve once workflows standardize	Disclose attach rate and uplift for proof-heavy contracts
Partner-assisted enterprise access	Indirect help from investor and partner ecosystems rather than public reseller pricing	No referral, channel, or rev-share economics disclosed	Series B use-of-funds and investor composition highlight partnerships, but terms are absent	Partner access may reduce sales friction without yet creating transparent channel economics	Disclose sourced-pipeline share and economics of investor- or partner-originated deals

Every row in this table is constrained by the same core fact: Code Metal has no public pricing page and no retained source with realized contract values. The reader should treat all pricing posture analysis as directional rather than quantified.

[CI017, CI018, CI022, CI028, CI030, CI037]

FI001: Revenue model bridge

Public evidence suggests Code Metal turns repository-specific migration and verification work into enterprise or government contracts, but the public record stops before realized pricing or recurring-software mix is visible.

This is a qualitative bridge, not a quantified waterfall. The retained public record is strong on what buyers purchase and weak on what they pay or how revenue is recognized.

[CI001, CI002, CI004, CI005, CI010, CI017]

4.2 Traction and GTM proxies: visible logos, strategic investors, and high-touch sales indicators

Public traction is more credible than a typical pre-metrics AI infrastructure company, but still incomplete. By February 2026 the company and Business Wire named Toshiba, RTX, L3Harris, and the U.S. Air Force as customers, and Salesforce Ventures said demand had already pulled Code Metal into programs of record across the Air Force and L3Harris. That is stronger than generic pilot language and suggests Code Metal is already selling into both enterprise or industrial accounts and government or defense programs. The tradeoff is that the public customer list is still short, which means concentration risk cannot be dismissed from public evidence alone. Go-to-market signals point to a long-cycle, partner-assisted enterprise motion rather than efficient bottom-up adoption. The careers page lists a Facility Security Officer, Forward Deployed Engineer, Principal Solutions Architect, and senior platform roles, all of which fit a motion involving procurement review, security handling, and bespoke deployment on customer hardware. The official Series B use-of-funds statement also prioritized expanding commercial and government partnerships, while the investor syndicate combines enterprise-software backers such as Salesforce Ventures, Accel, and B Capital with defense- and strategic-oriented names such as RTX, Shield, J2, and Overmatch. That mix likely helps introductions, procurement credibility, and channel access, even if the exact partner economics are undisclosed. The sales-efficiency question remains open because no CAC, payback, win-rate, or cycle-time metric is public. Still, the combination of programs-of-record language, defense-oriented hiring, and strategic investor overlap makes it reasonable to infer that sales cycles are long and expensive, and that partner relationships may be part of the efficiency story. If that is correct, then the next step in diligence is not to guess SaaS-style efficiency ratios but to determine how much revenue each high-touch deployment can unlock, retain, and expand once the initial migration work is complete.[CI008, CI009, CI010, CI012, CI013, CI014]

Traction and GTM proxies table
signal	public evidence	financial implication	confidence	diligence ask
Eight-figure contract-delivery claim	Series A announcement said Code Metal was already on contract to deliver eight figures in revenue that year	Supports real demand and non-trivial contract size, but says nothing about recurrence or realized collections	medium	Provide booked, billed, and collected revenue by quarter for 2025 and 2026
Named customer set	Toshiba, RTX, L3Harris, and the U.S. Air Force were publicly named by February 2026	Logo quality is strong and cross-sector, improving trust in relevance to regulated buyers	high	Disclose total active customer count, top-customer share, and pilot-to-production conversion
Programs-of-record language	Salesforce Ventures said demand had pulled the company into programs of record across the Air Force and L3Harris	Suggests deeper embed than a casual pilot, but dollar value and duration are not public	medium	Provide funded backlog, contract duration, and whether work is mission-critical or exploratory
Defense-oriented hiring	Facility Security Officer and customer-facing deployment roles appear on the careers page	Implies government security and implementation overhead, which can raise CAC but also deepen moats	medium	Show time-to-clearance, implementation effort, and attach rates for support-heavy roles
Strategic investor mix	Salesforce Ventures, Accel, B Capital, RTX, Shield, J2, and others appear across rounds	Can improve enterprise access, procurement trust, and channel credibility	medium	Break down sourced pipeline from investor and partner introductions
Customer and investor overlap	RTX is both a named investor and a named customer in public materials	Raises independence and concentration questions around reference quality	medium	Quantify revenue from investor-linked customers versus fully independent customers

These are proxies, not substitute metrics. They help bound GTM quality when CAC, payback, win rate, and customer count are not public.

[CI010, CI014, CI015, CI016, CI019, CI037]

4.3 Cost structure and margin path: R&D-heavy today, with leverage only if delivery standardizes

The likely cost structure is dominated by specialized labor. Code Metal's public research record spans formal verification, legacy-code migration, HPC translation, and domain-specific models, while the careers page shows demand for compiler tooling, formal methods, ASIC design and verification, platform DevOps, modeling and simulation, and customer-facing engineering. That combination implies a company spending heavily on scarce researchers and senior technical staff, not a lightly supported consumption API. The company is also solving a problem that its own materials describe as difficult: manual migration across architectures can require rare kernel experts and weeks of tuning per kernel, while high-assurance environments need stronger guarantees than compile-and-test alone. That labor profile cuts two ways for margin. In the near term it suggests meaningful R&D expense and a forward-deployed burden, because customers in defense, semiconductor, automotive, and industrial systems often need custom hardware targets, toolchain constraints, and verification standards. Public materials do not disclose the split between implementation services and reusable product. That creates real risk that onboarding and delivery are still services-heavy, which would slow gross-margin expansion even if demand is strong. The upside case is equally visible from first principles. Code Metal's research stack suggests the company is trying to convert hard one-off migration problems into repeatable workflows, domain models, and proof-generation techniques. If that abstraction succeeds, the economic model could move from senior-engineer-heavy project work toward software-enabled repeatability with better gross margins and lower marginal delivery cost per program. Public evidence is not yet strong enough to prove that transition has happened, so the margin path should be described as plausible but unverified rather than assumed.[CI003, CI004, CI005, CI006, CI007, CI019]

Unit economics table
metric	value / null	confidence	why it matters	diligence ask
ARR		medium	No public ARR means contract visibility cannot be translated into recurring software revenue quality	Provide current ARR, split between software subscription and services
Gross margin		medium	Gross margin is the key test of whether verification and translation are becoming scalable software rather than labor-heavy delivery	Provide company-level and product-level gross margin with services separated
Burn rate		medium	Burn determines how much of the $125M raise is runway versus growth capital	Provide trailing 12-month operating burn and quarterly burn trend
Cash on hand		medium	Cash balance determines whether Code Metal can prove repeatability before needing another financing	Provide unrestricted cash, debt, and working-capital profile
Customer count		medium	A four-logo public list cannot reveal concentration, average contract size, or expansion breadth	Provide active-customer count, top-10 concentration, and cohort expansion
Sales cycle length	Likely long and high-touch; not publicly quantified	medium	Enterprise and government procurement friction shapes CAC and payback	Provide median time from first meeting to signed pilot and from pilot to production
Services share of revenue		medium	Project-heavy delivery can delay margin expansion even when demand is strong	Provide revenue and gross margin split among platform, services, and government programs
R&D labor intensity	High specialized-engineering burden implied by research stack and hiring mix	medium	Senior research and formal-methods talent can create moat but also raise fixed cost	Provide headcount by function and fully loaded compensation by major team
Potential software leverage	Plausible if workflows and models are reused across similar migration problems	low	This is the upside case for future margin expansion	Provide cohort evidence that later deployments require less custom engineering and deliver higher gross margin

Null means not publicly disclosed in retained sources. Non-null qualitative rows are first-principles judgments anchored in public hiring, research, and customer-context evidence rather than management financial reporting.

[CI019, CI020, CI021, CI027, CI029, CI031]

FI002: Unit economics bridge

Code Metal's likely unit-economics path starts with expensive specialist labor and customer-specific deployment, then improves only if translation and verification workflows become reusable software.

The flow expresses a first-principles margin hypothesis rather than measured unit economics. It intentionally highlights where public evidence ends.

[CI019, CI020, CI021, CI023, CI024, CI030]

4.4 Capital adequacy and financing dependency: Series B buys time, but not proof of repeatable revenue

Code Metal's financing cadence is the clearest public strength in the chapter. The company disclosed a $13 million seed plus a prior $3.45 million pre-seed in July 2024, a $36.5 million Series A at a $250 million valuation in November 2025, and a $125 million Series B at a $1.25 billion valuation in February 2026. That implies about $177.95 million of publicly visible equity financing in less than three years from founding, and the SEC EDGAR results page shows Form D notices in 2023, 2024, twice in 2025, and 2026. The compressed jump from Series A to Series B in only a few months suggests investors saw enough contract momentum and market urgency to fund ahead of full public metric transparency. The challenge is that fundraising velocity is not the same as capital adequacy proof. Public sources do not disclose current cash on hand, monthly burn, runway, debt, or working-capital needs. The Series B use of funds—more engineering, product development, commercial and government partnerships, and GTM scale-up—reads like a company still investing aggressively rather than harvesting mature software margins. That means the $125 million raise likely improved near-term runway materially, but it does not eliminate dependence on showing that contract wins can become repeatable, scalable revenue rather than a sequence of expensive custom programs. Strategic-capital composition also matters. The syndicate can clearly help with customer access and market signaling, but it introduces overlap risk when an investor such as RTX also appears in the public customer set. In other words, Code Metal looks well financed for the next operating phase, but future financing terms will probably depend less on the novelty of the verification narrative and more on whether management can show durable conversion from high-touch translation projects into a more repeatable software-and-platform business.[CI011, CI012, CI013, CI014, CI015, CI033]

Capital adequacy table
capital item	public value / status	confidence	why it matters	diligence ask
Latest raise	Series B: $125M led by Salesforce Ventures	high	Provides the clearest public runway-strength signal entering 2026	Confirm net proceeds, close date, and any secondary component
Prior major raise	Series A: $36.5M at $250M valuation	high	Anchors the speed of the later step-up into the billion-dollar round	Provide Series A use of funds and conversion into signed revenue
Disclosed financing total	~$177.95M across pre-seed, seed, Series A, and Series B	medium	Shows substantial public capital access for a company founded in 2023	Confirm lifetime raised including any debt, grants, or undisclosed extension capital
Cash on hand		medium	Public raise amounts do not reveal how much cash remains after hiring and delivery spend	Provide current unrestricted cash and quarterly net cash burn
Monthly burn		medium	Runway cannot be inferred confidently without spend data	Provide monthly burn bridge with hiring, infra, and services-delivery components
Runway months		medium	The underwriting question is whether the company can prove repeatable revenue before the next financing trigger	Provide base-case runway and management trigger points for fundraising
Planned use of funds	Engineering capacity, product development, commercial and government partnerships, and GTM scale-up	high	Signals growth investment rather than a fully optimized margin structure	Provide 24-month budget by function and milestone
Next-round trigger	Not public; likely tied to proving repeatable revenue and product leverage beyond bespoke delivery	medium	Determines dilution risk and how much time investors are buying with the Series B	Provide board-approved milestones for next financing or cash-flow break-even
Debt / project-finance obligations	No debt or project-finance obligations disclosed in retained sources	medium	Debt could materially change cash risk and covenant flexibility	Provide debt schedule, guarantees, and any customer prepayment obligations
Filing cadence	SEC EDGAR shows Form D notices across 2023, 2024, 2025, and 2026	high	Corroborates a rapid financing cadence even where detailed terms are private	Provide the mapping between each Form D notice and the commercial milestones it funded

This table focuses on forward capital adequacy rather than retelling the full chronology. Null means the retained public record does not disclose the metric.

[CI011, CI012, CI013, CI033, CI034, CI035]

FI003: Financial estimate range

The most defensible public financial range is the capital and valuation step-up itself: disclosed financing accelerated sharply between 2024 and 2026, but operating metrics remain undisclosed.

This figure stays close to disclosed capital data because public burn, runway, and margin metrics do not exist in retained evidence. The only range item is the valuation step-up and elapsed months between rounds.

[CI011, CI012, CI013, CI033, CI034, CI035]

4.5 Evidence gaps and financial verdict: analytically positive, still too opaque for clean underwriting

The chapter's central evidence gap is straightforward: Code Metal has more public demand proof than public financial proof. The retained sources are good enough to support a bullish narrative on value proposition, contract relevance, customer quality, and investor appetite, but they stop short of the metrics required for conventional underwriting. There is no public ARR, no customer-count denominator behind the named logos, no disclosed gross margin, no burn, no cash balance, no contract-value detail, and no revenue split between platform, professional services, and government program work. That makes it impossible to tell from public evidence whether the company is already converging toward a scalable software model or still monetizing mostly through bespoke translation engagements. Adverse evidence does not disprove the business, but it does constrain confidence. Wired explicitly noted that methodologies in AI code tooling remain unproven and that investors are gambling that some picks-and-shovels vendors will work. Public customer proof also remains concentrated in a short disclosed logo list, while one of those logos—RTX—sits on both the customer and investor side. Programs-of-record language is encouraging, but no retained source quantifies backlog, award dollars, or win rates. The company therefore deserves credit for credible traction signals, but not for financial metrics it has not disclosed. The evidence-constrained verdict is that Code Metal likely has real early revenue, real enterprise and government demand, and real runway support after the $125 million Series B. The margin path could become attractive if translation, verification, and hardware-portability workflows are turning into reusable software rather than one-off engineering programs. But that upside remains a diligence hypothesis, not a public fact. Until management provides revenue mix, realized pricing, gross margin, burn, cash, and cohort-level customer data, the prudent financial stance is analytically constructive but underwriting-incomplete.[CI017, CI018, CI027, CI028, CI029, CI032]

Public financial gaps table
missing private metric or proof	impact on judgment	exact diligence path
ARR and recurring-software revenue split	Without this, public contract claims cannot be translated into repeatable software quality	Request monthly recurring revenue bridge split among platform, services, and government program revenue
Gross margin by product and services	Margin path is the main test of whether Code Metal is becoming software-led	Request gross margin by product line, services line, and cohort vintage
Burn and current cash balance	Runway cannot be underwritten from raise size alone	Request latest balance sheet, 13-week cash forecast, and trailing 12-month cash flow statement
Realized pricing and contract economics	No public source discloses ACV, discounts, usage assumptions, or verification uplift	Review three representative customer contracts with pricing schedules and renewal terms
Customer concentration and cohort expansion	A short public logo list may hide concentration risk or weak pilot-to-platform conversion	Request top-10 customer concentration, logo retention, and expansion by cohort
Services-to-platform conversion	The biggest business-model risk is staying stuck in labor-heavy custom migration work	Request revenue mix by delivery phase and evidence that repeat deployments need less custom engineering
Government backlog and award value	Programs-of-record language is encouraging but not financially quantifiable	Request funded backlog, remaining performance obligations, and contract vehicle detail
Debt, guarantees, and working-capital obligations	Undisclosed financing structure can change true runway and downside risk	Request debt schedule, covenants, guarantees, and any customer prepayment or milestone-payment obligations

These gaps are the specific blockers preventing a cleaner public-only underwriting case. Every item has a concrete diligence path rather than a generic request for more data.

[CI018, CI029, CI032, CI039, CI040, CI041]

Chapter 05

05Product & Technology

5.1 Product Workflow in Customer Terms

Code Metal's public product surface is unusually specific about workflow. The company does not position itself as a generic code assistant that writes snippets on demand. Instead, the product page asks an engineer to start from an existing high-level codebase, load that code through a Code Metal IDE plugin, let the system track module and library dependencies, and then define the runtime environment that the output must satisfy. Only after the user specifies the CPU, accelerator mix, resource constraints, and preferred toolchains does the system generate a transpilation and deployment plan. That sequencing matters because the product is really a migration-and-portability workflow, not only a model. Translation, verification, optimization, build targeting, and change tracking are presented as one loop for teams that cannot accept silent regressions. Official and investor materials keep tying this workflow to defense, aerospace, semiconductor, automotive, and other regulated settings where correctness, compliance, and hardware portability matter as much as raw coding speed.[CE001, CE002, CE003, CE004, CE005, CE006]

Product Module / Asset Matrix
Module / asset	Primary user	Public status / maturity	Differentiation	Diligence gap
IDE plugin and intake surface	Application or modeling engineer	Explicitly named on the product page	Keeps high-level source and dependency context inside a familiar IDE loop	No public screenshots or supported-IDE matrix
Runtime configuration and translation planner	Platform or embedded engineer	Explicitly named on the product page	Lets the buyer specify hardware resources and toolchains before generation	No public config-file format or orchestration API is disclosed
Translation engine	Compiler or software engineer	Publicly described but mostly through workflow copy	Combines cross-language translation with hardware-aware target selection	No public end-to-end success-rate benchmark on arbitrary codebases
Verification layer	Safety or mission-software lead	Core official differentiator	Proof-first posture is the main separator from generic coding AI	Proof coverage granularity and fallback behavior are not publicly quantified
Optimization layer	Performance engineer	Explicitly named on the product page and benchmark posts	Optimizes for runtime memory code size and power rather than only functional translation	Published results are selective and still described as preliminary
Deployment backends	Build or release engineer	Explicitly named output families	CPU GPU and FPGA backends appear in one workflow rather than separate products	No public packaging CI templates or supported build matrix
Research and benchmark stack	Research or compiler team	High public research cadence	LLMLift UniPar gpuFLOPBench and MonoCoder show active iteration on the core thesis	Promotion path from research artifacts to GA product remains opaque
Delivery and support layer	Customer platform team	Inferred from hiring and customer claims	Forward-deployed DevOps and security roles imply hands-on implementation support	No public SLA support tiers or reference architecture pack

Statuses reflect what is explicitly public as of 2026-06-13; several rows are maturity inferences from product research and hiring evidence rather than from formal documentation.

[CE003, CE004, CE005, CE006, CE007, CE008]

Workflow / Use-Case Table
User job	Current workflow pain	Code Metal solution	Evidence-backed benefit	Limitation
Port high-level algorithms to edge hardware	Teams otherwise rewrite Python Matlab or Julia logic by hand for target hardware	Plugin intake plus runtime-specific transpilation plan	Public workflow is concrete about source languages target definition and code generation sequence	No public customer case study walks through the full cycle end to end
Retarget CUDA workloads off NVIDIA	Manual kernel porting and tuning is slow and architecture-specific	Translate CUDA kernels to alternative GPU or accelerator targets while preserving correctness	Adreno and Hexagon examples show at least one non-NVIDIA portability path with validated outputs	Benchmarks are early and selective not a broad workload catalog
Convert MATLAB or signal-processing logic to HDL or FPGA	MATLAB-to-HDL workflows are brittle and syntax-repair heavy	MATLAB-to-HDL research plus FPGA output claims point to a structured translation path	Workflow-vs-agents research shows the company is iterating on this specific conversion problem	No public production deployment example is disclosed for the HDL path
Modernize legacy C C++ or Java to newer targets	Manual rewrites are risky in regulated systems	LLMLift and formal-verification messaging frame modernization as behavior-preserving translation	Research explicitly ties translation to newer runtimes such as Rust	Public docs do not specify supported legacy-language coverage exhaustively
Explore performance variants under hard resource limits	Portability alone does not guarantee deployable performance	Optimization layer offers memory runtime code-size and power tradeoffs	Portability benchmarks report some workloads at or above baseline performance	No public scheduler autotuning policy or cost model is described
Maintain generated code after human review	Generated code often drifts away from source intent once manually edited	Change-tracking loop suggests matching edits and annotations back to the input code	This can reduce rewrite debt if it works as advertised	No public version-control integration list or audit-trail artifact is shown

Benefits are separated into explicit measurements versus workflow claims; only the CUDA-portability row has a public performance-style benchmark in retained evidence.

[CE003, CE005, CE007, CE008, CE009, CE010]

FE002: Customer Workflow / Operating Flow

The public workflow runs from high-level source intake through runtime definition generation verification deployment and human change tracking.

This is a direct synthesis of the product-page sequence; public sources do not expose conditional branches such as proof failures solver timeouts or manual review gates.

[CE003, CE004, CE005, CE007, CE008, CE010]

5.2 Architecture, Lineage, and Operating Model

Public materials support a layered architecture even though Code Metal has not published a full system design. The visible layers are developer intake in the IDE, runtime and dependency analysis, a translation planner, generation backends for CPU, GPU, and FPGA outputs, a verification layer that claims functional-equivalence or safety checks, and an optimization layer that trades among runtime, memory, code size, and power. Series B and Salesforce Ventures materials explicitly call the stack neuro-symbolic, which is consistent with the seed-stage description of formal-methods-based analysis combined with custom coding models. The research lineage underneath that posture is unusually explicit for a startup: current product messaging ties to LLMLift, while company research also admits that earlier verified-lifting work such as Tenspiler exposed scaling and manual-rule limits. Independent technical pages for Metalift, Alvin Cheung, and Loris D'Antoni reinforce that Code Metal is drawing from real program-synthesis and formal-specification traditions rather than relying only on frontier-model marketing. The upside is a differentiated trust story; the caution is that solver constraints, specification design, and workload scoping probably remain material parts of real deployments.[CE012, CE013, CE014, CE016, CE023, CE024]

Technology / Operating Architecture Table
Layer / component	Public role	Key dependency	Main risk
Developer intake and context layer	Collects source code dependencies and local workflow context through the IDE plugin	Customer codebase plus developer environment	Supported IDEs repo shapes and dependency boundaries are not publicly documented
Runtime and deployment planner	Converts hardware resource and toolchain choices into a plan for generation and build	Target hardware metadata and build-tool integration	Unknown how much target modeling is automated versus manually curated
Translation layer	Generates target-language or target-backend code from high-level source	Custom coding models plus task-specific prompting and repair flows	Pure LLM generation is brittle on harder code-reasoning tasks so orchestration quality matters
Verification layer	Attempts to prove functional equivalence or absence of selected safety violations	Specifications proof generation and solver backends	Public sources do not quantify proof coverage or failure rates across production workloads
Optimization layer	Explores alternative implementations for memory runtime code size or power	Backend-specific heuristics and benchmark loops	Performance portability remains workload-specific and may require further tuning
Research substrate	Draws on LLMLift Tenspiler Metalift UniPar MonoCoder and related benchmark work	Program synthesis LLVM analysis Rosette CVC5 and domain-focused models	Research-to-product transfer path is visible in theme but not in concrete release notes
Support and integration layer	Provides CI/CD on-prem and customer-delivery surfaces through platform and forward-deployed roles	DevOps solutions architecture and security operations	The public surface suggests services-heavy onboarding rather than a purely self-serve platform
Governance and assurance context	Maps the product into defense-grade and critical-infrastructure assurance expectations	Formal specs proof evidence and AI risk controls	Product claims may outrun what current public governance artifacts actually expose

This architecture is reconstructed from public product copy research pages investor commentary and hiring rather than from a vendor-published system-design document.

[CE005, CE007, CE009, CE012, CE014, CE016]

FE001: Product Architecture Map

Publicly reconstructable layers from developer intake through proof optimization and hardware-target delivery.

Layer boundaries are reconstructed from product copy research artifacts and hiring. Code Metal has not published a full control-plane or system-design document.

[CE003, CE004, CE005, CE007, CE008, CE009]

5.3 Deployment, Integration, Support, and Maturity

Code Metal's strongest public maturity signals are around target breadth and implementation seriousness, not around polished self-serve packaging. The product page explicitly names CPUs, GPUs, FPGAs, and multiple toolchains, while the NVIDIA-portability research gives one concrete proof point that the portability story is more than aspirational: it describes validated translation from CUDA to OpenCL on Qualcomm Adreno GPUs and from serial CPU kernels to Hexagon Vector Extension NPUs, with some workloads meeting or exceeding baseline performance. Hiring fills in the operating model. Platform DevOps roles for CI/CD and cloud-plus-on-prem, a Solutions Architect, a Forward Deployed Engineer, and a Facility Security Officer suggest hands-on integration and government-adjacent delivery rather than a pure SaaS motion. Research output spans verified transpilation, workflow orchestration, HPC translation, specialized code models, and benchmarking, while official news pages show a steady cadence of funding and media moments tied to product development. The limiting factor is disclosure depth: there is still no public API reference, supported-runtime matrix, or support-SLA documentation comparable to a mature developer platform.[CE006, CE008, CE009, CE017, CE018, CE019]

Roadmap / Release / Development-Stage Table
Date / stage	Milestone	Status	Product implication	Source
2024-07 seed stage	Seed announcement says Code Metal is building modular and verifiable agentic workflows for the edge	Completed announcement	Shows the product thesis started as workflow orchestration plus formal methods not as a generic coding assistant	Seed announcement
2024 research lineage	LLMLift and verified-transpilation work become part of the public technical story	Public technical direction	Signals that proof-backed transpilation is not just marketing copy but an active research program	Verified transpilation pages and arXiv HTML
2024-2025 academic carryover	Tenspiler and Metalift remain visible antecedents in the company architecture story	Historical lineage still relevant	Suggests core abstractions and solver ideas predate the current company surface	Tenspiler and Metalift pages
2025 research breadth	Public research expands into workflow orchestration HPC translation and domain-specific code models	Active experimentation	Roadmap is broadening from one transpilation claim into surrounding reliability and model-layer questions	Workflow-vs-agents UniPar and MonoCoder
2025 Series A	Series A messaging says the platform is already used in defense automotive and semiconductor settings	Public commercialization milestone	Signals move from research posture toward live mission-critical delivery	Series A official page
2026 portability benchmarks	CUDA portability and validated kernel benchmarks become public	Public benchmark milestone	Adds evidence that hardware portability is more than conceptual	The real cost of leaving NVIDIA
2026 Series B	Capital is earmarked for engineering capacity product development and partnerships	Current growth phase	Implies more implementation support and target-market expansion rather than a frozen product surface	Series B official page and Business Wire
2026 hiring and media cadence	DevOps formal methods FDE FSO CNBC Wired and TBPN surfaces remain active	Current operational signal	Suggests roadmap execution is happening in parallel with external market education	Careers page and news index

This is a roadmap-signals table, not a formal release-note changelog. It uses public milestones to infer where the product appears to be maturing.

[CE013, CE015, CE017, CE020, CE025, CE027]

FE003: Critical Dependency Map

The delivery stack depends on target hardware metadata proof infrastructure customer environments and implementation support rather than on the model alone.

This map intentionally mixes technical and delivery dependencies because the public hiring surface suggests the product is not yet a purely self-serve workflow.

[CE005, CE006, CE016, CE030, CE033, CE039]

FE004: Product Maturity / Capability Map

Public-evidence-based view of which layers look mature today and which still rely on inference or private diligence.

Maturity labels reflect public evidence quality as of the run date, not internal shipping status. Several rows would need private diligence artifacts to move higher with confidence.

[CE012, CE016, CE036, CE038, CE039, CE040]

5.4 Trust, Safety, Limits, and Diligence Priority

The trust case is real but narrower than the headline suggests. Official and investor materials repeatedly stress mathematical proof, validation, compliance, and production-readiness, and the company's formal-methods explainer usefully clarifies why that matters: testing alone cannot establish the absence of bugs. Still, the public proof boundary is not fully specified. The retained sources support claims about behavioral preservation for selected translations and about validated kernels in benchmark settings, but they do not publish a proof-coverage ratio, a failure-mode taxonomy for solver timeouts, or a clear explanation of which properties are always proven versus which are only tested heuristically. The research history also provides adverse evidence. Tenspiler's complexity sensitivity and manual rule-building burden show why purely symbolic methods do not scale automatically. UniPar and gpuFLOPBench show that LLM-driven code reasoning remains brittle on parallel and mathematically messy workloads, which is exactly why Code Metal needs structured workflows and proof machinery in the first place. Finally, the privacy policy is the only explicit public security control surfaced here; it promises reasonable measures but no absolute guarantee, and the reviewed pages do not disclose SOC 2, ISO 27001, FedRAMP, or a public status page.[CE021, CE022, CE023, CE024, CE027, CE028]

Trust / Quality / Compliance Table
Control or risk area	Public signal	Status	Scope	Remaining gap
Formal equivalence or safety proof	Official and investor pages repeatedly claim mathematical proof or verified outputs	Strong marketing claim	Applies to translation correctness and selected safety properties not obviously to all deployment behavior	No public proof-coverage metric or always-proven property list
Testing versus proof distinction	Formal-methods explainer explicitly says tests cannot prove bug absence	Explicitly disclosed	Clarifies why Code Metal positions itself above generic code generation	No public evidence of how proof and testing are combined in customer pipelines
Portability benchmark validation	NVIDIA-portability post says generated kernels were validated as correct	Concrete but narrow	Shows at least one measured path from translation to validated performance	Workload coverage is selective and still called preliminary
LLM reliability measurement	UniPar gpuFLOPBench and workflow-orchestration research all study model failure patterns	Meaningful research surface	Supports the need for structured workflows and repair rather than one-shot prompting	No public statement on pass rates for production customer codebases
Privacy and data handling	Website privacy policy describes data collection rights and reasonable security measures	Basic control disclosed	Covers website data practices and a security disclaimer	No public data-residency tenant-isolation or training-data-use statement for the product itself
Security certifications and service assurance	Retained public pages do not name SOC 2 ISO 27001 FedRAMP or a public status page	Not disclosed	Absence itself is the observable signal	Enterprise and defense buyers will likely ask for private trust-center materials
Defense-grade assurance alignment	DARPA and NIST materials provide the external assurance context for mission-critical AI software	Context only	Supports why formal methods and assurance evidence matter in this market	Does not prove Code Metal has met a given certification or assurance threshold
Human or solver fallback	Tenspiler limits and benchmark caveats imply manual expertise and solver constraints remain part of delivery reality	Adverse evidence exists	Important for scoping real deployment risk and staffing requirements	No public fallback taxonomy timeout policy or human-review playbook

The table separates explicit controls from adjacent context and disclosure gaps. Absence claims are limited to the retained public pages reviewed for this chapter.

[CE016, CE021, CE022, CE023, CE027, CE028]

5.5 Exhibits

Chapter 06

06Customers

6.1 Customer map: public evidence points to four real buyer buckets, plus one still-unnamed semiconductor class

Code Metal's public customer story is not one monolithic enterprise bucket. The retained sources support at least five distinct demand surfaces: defense primes such as L3Harris and RTX; direct government buyers or programs centered on the U.S. Air Force; semiconductor or platform teams that need code portability across chips and toolchains; industrial or electronics accounts represented most clearly by Toshiba; and earlier logistics or edge operators represented by X-Press Feeders and HICO-linked testimony. The homepage and product pages reinforce why these segments cluster around the company: buyers care about verified translation, target-hardware configuration, and production readiness in environments where software failure has safety, compliance, or mission consequences. That buyer mix also implies a non-self-serve motion. Careers openings for a Facility Security Officer, Forward Deployed Engineer, Principal Solutions Architect, and on-prem or cloud platform roles are far more consistent with long, integration-heavy selling than with low-friction PLG adoption. The practical read is that segmentation is visible, but the book of business still appears narrow and specialized rather than broadly diversified.[CU001, CU002, CU003, CU004, CU005, CU007]

Customer segmentation table
Segment	Buyer / user / payer	Use case	Scale / strategic value	Revenue or strategic read	Gap
Defense primes	Prime contractor engineering or mission-software team buys; developers and program leads use; program budget pays	Porting, validating, and optimizing code for mission systems or edge hardware	Named proof: L3Harris and RTX	Highest strategic credibility because defense programs are reference-rich if they convert	Direct customer quotes, contract size, and prime vs subcontract status are not public
Government buyers / programs	Government program office or Air Force-linked buyer pays; operators and software teams use	Modernization and verification in mission-critical environments	Named proof: U.S. Air Force plus programs-of-record language	Potentially large ACVs and long life cycles if system-of-record placement is real	No public award IDs, program numbers, or budget lines were retained
Semiconductor / platform teams	Platform, compiler, or enablement team would buy; internal developers use; engineering budget pays	Port code across chip platforms, accelerators, and toolchains	Public evidence shows semiconductor deployments plus talks with a large unnamed chip company	Could become sticky because portability and verification tie into target hardware choices	No named chip customer or independent semiconductor case study is public
Industrial / electronics enterprises	Industrial software team or electronics OEM pays; engineering teams use	Move production code across runtimes while preserving safety and performance	Named proof is concentrated in Toshiba and generalized industrial positioning	Useful for non-defense diversification and commercial credibility	No outcome metric, scope detail, or testimonial beyond naming the logo
Logistics / edge operators	Operator or incubator-backed deployment owner pays; operations and product teams use	Edge-development workflows and intelligence in logistics network	Older proof from X-Press Feeders and HICO-linked testimonial	Shows the company could serve real industrial workflows before the later defense-heavy narrative	It is an early 2024 proof point and may not map cleanly to the later code-translation core motion

Rows separate who pays, who uses, and what the public proof actually covers so logos are not mistaken for homogeneous customer economics.

[CU001, CU002, CU004, CU005, CU007, CU009]

Customer evidence quality table
Proof surface	Example	Source basis	What it proves	What it does not prove	Current read
Official named-customer release	Series B release naming Toshiba, RTX, L3Harris, and U.S. Air Force	Code Metal plus Business Wire	The company is willing to attach its name to specific accounts	Does not prove renewal, contract size, or production depth	Useful baseline proof, not enough for durability underwriting
Investor thesis / portfolio post	Salesforce Ventures and B Capital customer writeups	Investor / partner sources	Sophisticated backers see real adoption and strategic relevance	Investors are not disinterested customer witnesses	Helpful corroboration but still aligned with financing narrative
Customer-quoted proof	HICO-linked quote about logistics network transformation	Seed-stage company release	At least one outside user-style voice exists in the public pack	It is older, not clearly representative of current core defense accounts	Best direct testimonial, but thin and stale relative to 2026 scale claims
Independent press	Wired and CNBC reporting on customers, pricing, and account mix	Independent news	Third-party reporters could verify enough to print named customer set and pricing commentary	Press still depends heavily on management interviews and does not publish renewal data	Good corroboration for existence, weak for account economics
Official web surface	Homepage, product, research, and media-amplification posts	Company website	The company openly markets regulated, hardware-specific use cases	The site does not offer a deep customer-story library or review surface	Evidence surface is more product- and fundraising-led than case-study-led
Public procurement / operating record	USAspending and SAM search pages did not surface a usable award file or direct program documentation in the retained pack	Absence across retained public sources plus generic federal search surfaces	Shows where diligence still needs primary documents	Does not disprove customer activity, only limits external verification	Largest remaining gap for government-proof quality

Rows grade the quality of public customer proof, not the quality of the customers themselves.

[CU023, CU024, CU029, CU030, CU033, CU034]

FU001: Customer journey map

The implied path starts with a painful legacy-code or chip-portability problem, then moves through a scoped verified deployment before any durable expansion can happen.

This is an evidence-led journey map inferred from public sources, not a disclosed internal sales process or actual funnel conversion report.

[CU002, CU003, CU017, CU018, CU019, CU026]

6.2 Named proof: the logos are real, but most public evidence still comes from company and investor channels

The chapter's strongest positive signal is that Code Metal does publicly name real accounts rather than relying only on anonymous enterprise references. The February 2026 Series B release and Business Wire version both name Toshiba, RTX, L3Harris, and the U.S. Air Force, while Salesforce Ventures says demand has already pulled the company into programs of record across the Air Force and L3Harris. The November 2025 Series A release adds a more concrete L3Harris performance datapoint, with Shield Capital saying Code Metal sped code translation from weeks to days across several projects. The July 2024 seed announcement adds two older but still relevant proof surfaces: strategic partnerships with L3Harris and X-Press Feeders, plus a direct testimonial from HICO's Chris Hartnoll about Code Metal changing how intelligence is built into a logistics network. That said, proof quality is uneven. RTX is both investor and named customer, which weakens its value as a clean third-party reference. Toshiba is named, but no public user quote, case study, or outcome metric was found. For the Air Force, the public pack supports interest and likely operational relevance, but not contract IDs, award dollars, or direct program documentation; retained USAspending and SAM search pages did not by themselves expose an award trail. Additional J2 Ventures and Shield Capital surfaces reinforce that many accessible third-party channels still sit inside the backer ecosystem rather than inside customer-authored documentation.[CU005, CU006, CU007, CU008, CU009, CU010]

Customer growth / adoption trajectory table
Metric / proxy	Value	Date	Source	Implication	Missing denominator
Early named partnership proof	X-Press Feeders and L3Harris named as strategic partnerships while company said it was already generating revenue	2024-07	Code Metal seed announcement	There was customer traction before the later venture step-up	No contract values or customer count disclosed
Deployment across key sectors	Platform described as deployed across defense, automotive, and semiconductor industries	2025-11	Code Metal Series A release	Suggests adoption moved beyond a single defense niche	No account count or deployment count by sector
Revenue-scale commentary	Company said it was already on contract to deliver eight figures in revenue that year	2025-11	Code Metal Series A release plus CNBC coverage	Meaningful demand signal for a young company	Not broken out by recurring vs services or by customer
L3Harris project depth	Shield Capital said Code Metal cut translation from weeks to days across several L3Harris projects	2025-11	Code Metal Series A release	Best public outcome detail for a named defense-prime account	No customer-authored confirmation or dollar value
Named customer set at Series B	Toshiba, RTX, L3Harris, and the U.S. Air Force publicly named as customers	2026-02	Code Metal / Business Wire	Official logo set remained consistent and referenceable by early 2026	No denominator for total active accounts
Programs-of-record commentary	Salesforce Ventures said demand pulled Code Metal into programs of record across the U.S. Air Force and L3Harris	2026-02	Business Wire and Salesforce Ventures	Suggests movement beyond pure pilot language	No public procurement record in the pack verifies exact program scope
Support-motion proxy	FSO, FDE, Principal Solutions Architect, and COO scale-up were all public	2026 current	Careers page and GeekWire	Customer motion likely needs hands-on security, solutions, and operational support	No disclosed ratio of support staff to active customers

Trajectory rows use public milestones and operating proxies, not internal CRM funnel or cohort data.

[CU003, CU005, CU006, CU009, CU011, CU012]

Named customer proof table
Customer	Segment	Deployment / use case	Production vs pilot	Outcome / proof	Limitation
U.S. Air Force	Government buyer / program	Mission-critical code translation and verification; investor says programs of record	Production-like use claimed, but exact stage undisclosed	Named in official Series B materials and repeated by Salesforce Ventures	USAspending and SAM search surfaces in the pack do not themselves reveal a contract ID, award amount, or user testimonial
L3Harris	Defense prime	Multiple code-translation projects and programs-of-record narrative	At least multi-project reference use; production depth still not independently verified	Shield quote says translation sped from weeks to days across several projects	No direct customer-authored case study or contract value
RTX	Defense prime and investor overlap	Mission-software and hardware-optimization use cases implied by company materials	Customer status claimed; production depth unclear	Named repeatedly in official and investor materials	Reference quality is weakened because RTX is also an investor
Toshiba	Industrial / electronics account	Move code between languages and optimize for hardware	Customer status claimed; stage not disclosed	Named in official Series B materials and investor posts	No public outcome metric, timeline, or testimonial
X-Press Feeders / HICO logistics network	Industrial logistics / edge deployment	Designing, building, and delivering intelligence into a logistics network	Reference deployment or partnership proof from early commercialization	Includes the clearest public user-style quote in the pack	Older proof point tied to edge/logistics narrative, not later defense-heavy positioning

This table exhaustively enumerates named external customer or customer-quoted proof surfaces visible in the prepared customer pack; it excludes unnamed chip prospects and generic industry claims.

[CU005, CU006, CU009, CU010, CU012, CU028]

FU002: Adoption / deployment funnel

The public record narrows quickly from a handful of named logos to almost no disclosed durability metrics.

Counts summarize only the prepared source pack for this chapter and deliberately separate named proof from durable-economics disclosure.

[CU005, CU010, CU012, CU025, CU028, CU030]

6.3 Durability: likely sticky if deployments land, but public retention evidence is still mostly absent

Public evidence is directionally positive on durability but weak on direct measurement. The favorable side is structural: Code Metal sells into accounts that appear to care about hardware-specific deployment, formal verification, and regulated or mission-critical outcomes. If a customer has already configured translation plans around its CPUs, GPUs, FPGAs, toolchains, or security constraints, then switching away should be expensive in time, engineering attention, and re-validation. Independent technical and policy sources help explain why. The LLMLift and Tenspiler work, along with DARPA and NIST material, all reinforce that translation into specialized runtimes or high-assurance environments is hard, bug-prone, and validation-heavy. That creates the possibility of long-lived, high-value accounts if the first deployment works. But the public record does not let an investor jump from plausible stickiness to proven retention. There is no public NRR, GRR, churn, contract-length, renewal, expansion-rate, or cohort disclosure. The best available proxies are weaker: L3Harris is described as several projects, the Air Force and L3Harris are said to have programs of record, and a low-tier secondary piece relays management's claim that every deployed pilot moves to the next phase. Those are encouraging signals, not retention accounting.[CU018, CU019, CU020, CU021, CU022, CU025]

Retention / repeat usage / satisfaction table
Metric / proxy	Value	Segment	Confidence	Diligence ask
Formal NRR / GRR / churn		All segments	High that it is undisclosed	Request retention cohorts, logo churn, and gross retention by segment
Renewal rate or contract term		All segments	High that it is undisclosed	Request median contract length, renewal timing, and cancellation terms
L3Harris repeat-use proxy	Several projects; weeks-to-days translation claim	Defense prime	Medium	Request project count, dates, ACV, and whether the work rolled into a recurring platform relationship
Programs-of-record proxy	Salesforce Ventures says U.S. Air Force and L3Harris programs of record	Government / defense	Medium	Request program IDs, award value, and whether Code Metal is prime, subcontractor, or software component supplier
Pilot-conversion claim	Management claim relayed in secondary coverage that every deployed pilot goes to the next phase	Cross-segment	Low	Request customer-by-customer pilot-to-production conversion data
Public testimonial depth	One clear customer-style quote in the retained pack, from HICO-linked logistics context	Industrial / logistics	Medium	Request three current customer references with use case, KPI, and renewal status

Null means no public metric was found; proxy rows are weaker than true retention disclosure and should not be treated as SaaS-style cohort evidence.

[CU010, CU025, CU028, CU030, CU033, CU035]

FU003: Customer proof matrix

Proof quality is strongest on logo existence and weakest on independent retention visibility.

Grades reflect public proof quality, not customer quality. Low revenue visibility means the disclosure is absent, not that the account is unimportant.

[CU023, CU028, CU029, CU030, CU031, CU032]

6.4 Reference risk: few logos, heavy defense mix, and thin third-party validation keep concentration risk high

The adverse case is not that customer demand is fake; it is that the visible proof is narrow, defense-heavy, and mediated by interested parties. The same four names—U.S. Air Force, L3Harris, RTX, and Toshiba—recur across official releases, investor theses, and recycled secondary coverage, while independent customer testimonials remain scarce. Code Metal's own public surface is dominated by funding announcements, research essays, and media-amplification posts rather than a mature customer-story library. That matters because logos alone do not answer the underwriter's key questions: how concentrated revenue is, whether the company sits on prime or subcontract paper, how renewals behave, and whether the customer relationship survives the initial migration project. Procurement friction is also likely material. Government and defense customers often require security handling, formal assurance, and long buying cycles, and the open roles plus new COO hire fit that interpretation. Retained USAspending and SAM search pages add official context but still do not surface the contract IDs, entity trail, or award dollars needed to verify whether Code Metal holds direct federal paper. Investor-customer overlap further muddies reference quality: RTX is a funder and a named customer, while J2 Ventures and Shield Capital add more partner-owned surface area to the external narrative without supplying customer-authored proof. The net conclusion is constructive on relevance but cautious on durability. Public evidence proves that Code Metal has landed credible reference accounts; it does not yet prove diversified, independently verified, renewal-rich customer economics.[CU023, CU024, CU025, CU026, CU031, CU034]

Expansion and concentration risk table
Expansion driver	Concentration or friction risk	Impact	Diligence path
Government programs of record	Procurement cycles may be slow and security-heavy, and retained USAspending/SAM searches do not themselves verify the award trail	Can create large durable accounts, but timing and cash conversion may be uneven	Obtain award history, buying authority, contract identifiers, and security/compliance steps by account
Defense-prime modernization	Few visible logos mean one or two accounts may dominate public proof	High revenue concentration risk if prime accounts are still a large share of bookings	Request top-5 customer mix and pipeline by prime
Semiconductor portability workflows	Named chip customer is still absent from the public pack	Good strategic adjacency, but conversion is unproven	Request name, stage, and deployment KPI for the unnamed chip-platform account
Industrial / electronics accounts	Toshiba broadens the story beyond defense, but only one public electronics logo is named	Commercial diversification may be thinner than headline vertical list implies	Request additional named industrial customers and proof of repeat deployments
Hardware-specific integration and verification	Custom runtimes and toolchains can create sticky deployments once landed	Potential upside to retention and expansion if first deployment succeeds	Request expansion revenue from follow-on modules, hardware targets, or business units
Investor-customer overlap and PR-mediated proof	RTX and investor theses help access but weaken independence of the reference set; J2 and Shield add more partner-owned narrative surface	Raises risk of overstating unbiased customer validation	Request direct customer references that are not also investors or financial sponsors

This table separates expansion upside from the evidence gaps that prevent public proof of diversified customer economics.

[CU006, CU015, CU026, CU027, CU031, CU037]

6.5 Exhibits

Chapter 07

07Risks

7.1 Risk priority and adverse stance

Code Metal is easier to like than to underwrite. The public record supports a company solving real modernization pain in environments where failure matters, and it supports unusually strong financing velocity for such a young business. The adverse view is that this same profile concentrates risk: trust, certification, and deployment proof matter more here than in ordinary developer tooling, while the company has disclosed far less operating evidence than the $1.25 billion valuation implies. The most material risks therefore sit above ordinary startup execution noise. The company sells into mission-critical settings, names only a short customer list publicly, and relies on a story of verifiable translation whose public validation still comes mainly from papers, investor theses, and company materials rather than from field reliability metrics. That means the biggest risk is not that no one cares, but that buyers care deeply and still require more proof, more services, and more approval work than the current valuation discounts. This chapter therefore takes the strongest adverse stance in the report: assume demand exists, then ask whether it compounds into repeatable software economics under real regulatory, procurement, and delivery constraints. If diligence cannot close those proof gaps, the financing story is ahead of the operating story.[CR001, CR002, CR003, CR004, CR005, CR006]

FR001: Code Metal risk heatmap

Heatmap of the highest-priority Code Metal risks, showing where residual exposure remains high even after visible mitigations.

Likelihood, impact, and mitigation maturity are qualitative underwriting judgments based on retained public evidence as of 2026-06-13. Residual exposure expresses what remains after visible mitigations, not a forecast of failure certainty.

[CR001, CR007, CR028, CR039, CR041, CR043]

7.2 Regulatory, legal, and provenance risk

The regulatory burden is meaningful because Code Metal is not selling an entertainment chatbot or a generic code helper. It is explicitly marketing trustworthy translation for critical systems, which puts it closer to the AI Act's high-risk logic and to government software-assurance expectations than to consumer AI norms. The EU framework now combines GPAI transparency, copyright-related obligations, and explicit high-risk controls for safety-relevant use cases. Even if Code Metal can avoid the harshest interpretation in some deployments, the administrative burden still rises as it sells into more sensitive workflows or publishes more model capability claims. The European Commission's FAQ also frames the AI Act as a uniform EU-wide regime whose high-risk categories and provider duties sit inside a living implementation process rather than a one-time compliance memo. That matters because Code Metal's mission-critical positioning can force customers to ask not only whether the product works, but whether disclosures, training-content summaries, labeling, and post-market governance will travel with it into regulated programs. The legal risk is less about a known active case and more about what the public record does not settle. The Copyright Office has made both training-data scrutiny and output copyrightability active policy topics: its January 2025 report says purely AI-generated material is not copyrightable absent sufficient human control, while a subsequent part is reserved for training, licensing, and liability questions. Retained sources still do not disclose the company's training-data provenance, licensed-data posture, or how any academic lineage translates into clean commercial IP ownership for generated code, proofs, or derivative artifacts. The privacy policy is also only a partial answer because it governs website data collection rather than the deeper customer-code, retention, and model-boundary issues enterprise buyers will ask about. The result is a classic diligence asymmetry: the public story is good enough to explain why investors funded the company, but not good enough to close questions on litigation clearance, provenance, or product-specific data handling without direct management materials.[CR008, CR009, CR010, CR011, CR012, CR013]

Regulatory / legal risk register
Rule / issue	Jurisdiction	Status	Likelihood	Severity	Mitigation maturity	Residual exposure	Diligence path
EU AI Act GPAI and high-risk obligations	European Union	Active; GPAI rules effective and transparency rules due August 2026	Medium-High	Critical	Medium	Documentation, copyright-summary, and labeling obligations can slow deployments into regulated or public-interest use cases	Request the compliance workplan, model governance owner, and any planned training-data summaries
Government software certification and assurance burden	United States defense / federal	Active programmatic burden rather than a one-time approval	High	High	Low-Medium	Mission deployments may take longer than commercial pilots because assurance evidence must survive procurement and review workflows	Map each named program to required accreditation, test evidence, and approval owners
Training-data provenance and copyright scrutiny	United States / global	Policy scrutiny is active; no company-specific resolution disclosed	Medium	High	Low	Any narrowing of acceptable training-data practice could increase licensing cost or retraining burden	Review training-data provenance, licenses, model cards, and indemnity language
Generated-output ownership and AI copyrightability	United States / global enterprise contracting	Active policy issue; output protection depends on human contribution and contract drafting	Medium	High	Low	Buyers may demand explicit ownership, review, and data-rights language before trusting generated mission code or proofs in production	Review customer contract terms for authorship, output ownership, data rights, and human sign-off requirements
Privacy and code-handling obligations	Global enterprise deployments	Public website privacy policy exists, but product-code handling detail is thin	Medium	Medium-High	Low-Medium	Buyers may demand more clarity on retention, segregation, and customer-code use than the website policy provides	Request the product DPA, retention schedule, and code-isolation architecture
Litigation, patent, and IP-license visibility gap	United States / global	No public clearance from retained sources	Medium	Medium-High	Low	Absence of docket or patent evidence is not the same as absence of exposure, especially with academic lineage and mission software	Run docket and patent searches and inspect enterprise contract indemnities

Severity ranks reflect underwriting impact, not legal certainty. The register emphasizes where public evidence stops short of clearing risk and where management materials would be decisive.

[CR008, CR009, CR010, CR011, CR012, CR013]

FR002: Risk transmission map — how trust and proof gaps flow into valuation risk

Directed graph showing how regulatory, technical, and disclosure risks compound into slower deployments, thinner revenue proof, and valuation pressure.

[CR008, CR011, CR024, CR026, CR028, CR041]

7.3 Operational, partner, and execution risk

Operationally, the core question is whether Code Metal's verification and portability claims scale economically outside curated examples. The product surface asks users to define target hardware, toolchains, and resource limits up front, while the research record itself shows both progress and boundaries: verified lifting works, but within explicit domains, and LLMs still struggle on complex reasoning about code performance. That combination is promising for hard use cases, yet it also implies that translation quality may still require substantial human review, per-target tuning, and deployment-specific support. The hiring record reinforces that concern. A Facility Security Officer, forward-deployed roles, and senior compiler or formal-methods positions fit a business that must win trust account by account and often program by program. That is compatible with early success, but it is not yet proof of a light-touch platform. If every major win needs heavy integration, security handling, and bespoke verification work, support burden can grow nearly as fast as revenue. GAO's 2026 review of federal AI acquisitions is a useful external check here: agencies reported difficulty accessing AI technical experts, understanding AI-related costs, and often buying AI as an ongoing service rather than a shrink-wrapped product. For a company selling mission software, that backdrop points toward longer evaluations and more continuing vendor involvement than ordinary developer-tool adoption. Partner and execution risk compound that burden. Growth is publicly tied to programs of record and a short list of marquee customers, while RTX sits on both the investor and customer side. National-security-oriented investors and academic lineage are advantages, but they also highlight how much the proof story still depends on a concentrated network of programs, people, and affiliated technical credibility. That same network is operating in a policy environment where GAO says DOD still needs department-wide AI acquisition guidance and where CISA argues software producers, including AI vendors, should bear the secure-by-design burden with secure defaults, logging, and lifecycle security ownership. Those are reasonable customer asks, but they can lengthen deployment work and increase the evidence load on Code Metal well beyond model quality alone.[CR018, CR019, CR020, CR021, CR022, CR023]

Operational / quality / security risk register
Failure mode	Likelihood	Severity	Mitigation maturity	Residual exposure	Unresolved gap
Formal-verification coverage fails to scale across large mixed-language or hardware-specific codebases	Medium-High	Critical	Medium	Bounded DSL and backend coverage could leave meaningful manual-review burden in real deployments	No public data on what percentage of customer code can be translated and proved without custom work
Generated code passes narrow tests but fails in production performance or edge cases	Medium	High	Low-Medium	Complex-kernel reasoning limits and missing field pass-rate data leave reliability risk material	No public benchmark links model output quality to production defect or rollback rates
Embedded or heterogeneous hardware deployments require extensive per-target configuration and tuning	High	High	Medium	Users must specify target hardware, toolchains, and resource limits before work begins	No public evidence quantifies how much of that setup is reusable versus bespoke
Forward-deployed delivery and support burden erodes software leverage	High	High	Low-Medium	Hiring mix implies sustained customer-specific implementation work and security handling	No public data on implementation hours, support ratios, or post-deployment maintenance load
Security or safety expectations outrun published operating evidence	Medium	High	Low	Mission-critical positioning raises expectations faster than public SLA or incident data has been shared	No public reliability, incident, or assurance-yield metrics are retained
Secure-by-design expectations exceed published lifecycle controls	Medium	High	Low	Public-sector buyers can reasonably ask for secure defaults, logging, vulnerability handling, and AI-lifecycle security evidence that is not visible in retained sources	No public SBOM, secure-default settings, vulnerability-response, or incident-preparedness materials are retained

This register focuses on whether a verifiable-code narrative survives operational reality at customer scale. Ratings are evidence-constrained and should be refreshed with customer quality metrics.

[CR015, CR016, CR017, CR018, CR019, CR020]

Partner / dependency risk register
Dependency	Counterparty / anchor	Role	Concentration	Failure scenario	Severity	Mitigation maturity	Residual exposure
Named defense programs and programs of record	U.S. Air Force, L3Harris, defense primes	Reference demand and procurement access	High	Evaluation cycles slip or contract paper sits behind primes, delaying conversion to repeat revenue	Critical	Low-Medium	Strong logos exist, but economic depth is still not independently visible
Investor-customer overlap	RTX / RTX Ventures	Signal, introductions, and validation	Medium-High	A strategic investor-customer overlap flatters proof quality without broadening the independent customer base	High	Low	Public evidence does not show how much revenue is tied to overlapped relationships
National-security investor network	J2 Ventures, Shield Capital, Overmatch, Salesforce Ventures	Capital, defense access, and signaling	Medium	If the next growth leg depends on the same network, diversification into broader commercial demand may lag	Medium-High	Medium	The network is an advantage until it becomes the main path to every large account
Hardware and toolchain pain thesis	CUDA migration, CPUs, GPUs, FPGAs, custom toolchains	Core value proposition driver	High	If buyers solve portability with internal tools or simpler rewrites, Code Metal loses urgency	High	Medium	The thesis depends on pain remaining large and frequent enough to sustain budget
Public procurement record visibility	USAspending and SAM search surfaces	Independent verification channel	Medium	Lack of directly attributable public record prevents clean validation of contract depth	Medium	Low	Management could clear this quickly, but public evidence does not do it today

Dependency here includes not just vendors but the external programs, records, and relationships the thesis relies on for validation. The cap table is strategically useful but narrows the path to proof.

[CR027, CR028, CR029, CR030, CR031, CR032]

People / execution risk register
Role / function	Dependency or gap	Likelihood	Severity	Mitigation maturity	Diligence path
Founder technical and commercial leadership	Morales and Showalter-Bucher remain central to mission, technical positioning, and defense credibility	Medium	High	Low-Medium	Test succession plans, customer-ownership mapping, and second-line technical leadership
Academic and scholar bench	Visible external lineage includes Loris D'Antoni and Berkeley-rooted verified-lifting research	Medium	High	Low	Determine what IP, code, and know-how are fully institutionalized inside the company
Operating bench below founders	Public visibility improves with Ryan Aytay, but board and leadership depth remain thin in the retained record	Medium	Medium-High	Low	Request org chart, board roster, and role owners across delivery, security, and government programs
Recruiting scarce specialist talent	Compiler, formal-methods, ASIC-verification, and forward-deployed roles are difficult to fill and expensive to retain	High	Medium-High	Low-Medium	Review attrition, offer acceptance, compensation pressure, and time-to-fill by critical role

Execution risk is less about generic startup hiring and more about whether a very specialized bench can scale without degrading delivery quality or proof quality.

[CR033, CR034, CR035, CR036, CR037, CR038]

FR003: Critical dependency map — people, programs, and proof channels

Dependency map showing the external programs, investors, and academic nodes whose cooperation or continuity matters to Code Metal's proof story.

[CR029, CR030, CR034, CR035, CR036, CR044]

7.4 Financial risk, mitigations, and thesis-breaks

Financially, the biggest risk is opacity at a price point that no longer tolerates much ambiguity. Public sources confirm repeated capital formation and a rapid move from a $250 million Series A to a $1.25 billion Series B valuation, but they do not disclose ARR, gross margin, burn, retention, or revenue mix. That leaves investors relying on a narrative of strong contracts and elite technical differentiation without being able to prove that the business already behaves like scalable software. The mitigations are real but incomplete. Formal-verification lineage, mission-critical positioning, and access to defense-oriented investors reduce go-to-market friction and help explain the funding pace. They do not, however, substitute for deployment-quality metrics, customer diversification, or a clear services-to-platform conversion. The burden of proof now sits on whether named reference accounts broaden and whether support effort declines as the installed base grows. The cleanest monitoring framework is simple: watch compliance friction, reliability evidence, customer breadth, delivery intensity, and the next financing or internal-mark signal. If trust work, services load, or procurement delay overwhelm the speed advantage, the investment case breaks before the technology story does.[CR039, CR040, CR041, CR042, CR043, CR044]

Financial / model risk register
Risk	Likelihood	Severity	Mitigation maturity	Residual exposure	Investment implication
Valuation moved ahead of public proof	High	Critical	Low	A $1.25 billion mark arrived before public ARR, margin, or retention disclosure	Entry price demands private diligence support rather than public-metric comfort
Runway not publicly disclosed despite large financing	High	High	Low-Medium	Repeated financing lowers immediate solvency risk but does not reveal cash efficiency	Investors need an internal model before assuming the Series B fully de-risks capital needs
Services-to-platform conversion remains unresolved	High	High	Low-Medium	Public signals still fit a bespoke deployment motion as much as a scaled software model	Gross-margin and valuation upside depend on productization, not just contract wins
Revenue quality is visible through contracts, not through recurring-software metrics	High	High	Low	Eight-figure contract language may mask one-time or labor-heavy economics	Underwriting should focus on mix, renewals, and implementation burden per dollar of revenue
Defense-style sales and acceptance cycles can lengthen cash conversion	Medium-High	Medium-High	Low-Medium	Program success can still create slow revenue recognition and expansion timing	Forecasts need conservative assumptions for conversion, acceptance, and budget timing

This table translates public opacity into underwriting risk. The issue is not whether demand exists, but whether that demand already behaves like compounding software revenue.

[CR005, CR006, CR007, CR039, CR040, CR041]

Mitigation and kill criteria table
Risk	Monitorable trigger	Threshold / event	Action implication
Compliance burden outruns delivery speed	Evidence of delayed launches or deal slippage tied to AI Act, certification, or security review	Two or more flagship deployments slip because trust or compliance work outweighs translation speed gains	Re-underwrite product fit in regulated deployments and demand a compliance roadmap before investing
Reliability claims remain unproven	Management cannot provide production pass-rate, rollback, or defect-escape data during diligence	No deployment-quality scorecard for named customer programs	Treat trust claims as marketing-led and reduce conviction on platform scalability
Customer concentration persists	Public customer roster remains effectively the same after the next financing cycle	No broadening beyond a small set of named defense or industrial logos	Assume revenue concentration and lower terminal multiple assumptions
Services burden dominates	Implementation or support effort per deployment stays high relative to contract value	Forward-deployed or solutions headcount grows as fast as engineering and revenue	Model slower margin expansion and question software-platform conversion
Financing resets before metrics catch up	Next financing or secondary signal prices the company materially below the implied Series B expectations	Down round, flat round, or internal mark pressure without public KPI improvement	Avoid price-taking behavior and revisit whether the thesis still works at public-evidence quality
Key-person or research loss	Founder, COO, or visible research affiliate departs without succession clarity	Departure plus no named technical or commercial successor within a quarter	Pause investment or require governance and retention protections

These are investor monitoring triggers rather than legal obligations. They are designed to surface when trust, delivery, and proof assumptions stop compounding into a scalable platform story.

[CR045, CR046, CR047, CR048, CR049, CR050]

Chapter 08

08Valuation

8.1 Financing context and public price support

Code Metal has real financing momentum, but the public support for the price is still incomplete. The strongest hard facts are straightforward: the company disclosed a $36.5 million Series A at a $250 million valuation in November 2025 and then a $125 million Series B at a $1.25 billion valuation in February 2026. That is a roughly fivefold valuation jump in only a few months, with about $177.95 million of visible capital across pre-seed, seed, Series A, and Series B. Named customers and mission-critical positioning make the jump intelligible, and the SEC Form D trail supports a fast financing cadence. The problem is not that the price is impossible; it is that public evidence does not yet show the operating bridge. Retained sources do not disclose ARR, gross margin, burn, cash, retention, customer count, or cap-table overhang terms. Public support therefore comes primarily from investor quality, customer logos, and narrative conviction rather than from the KPI package that would normally make a $1.25 billion private mark feel durable. That pushes the chapter toward scenario logic and entry discipline rather than heroic multiple math.[CV001, CV002, CV003, CV004, CV005, CV006]

Recommendation summary table
Lens	Current read	Evidence basis	Decision implication
Recommendation	research-more	Public evidence validates real product, customers, and financing, but not the KPI bridge behind a $1.25B mark.	Do not underwrite as a buy from public data alone.
Confidence	medium	Core financing facts are corroborated, while economics, concentration, and terms remain private.	Advance only with management data access.
Risk rating	high	Valuation opacity, services-mix risk, and concentration risk are still live.	Model material downside alongside upside.
Valuation stance	expensive	Today’s price already assumes platform-like outcomes not yet publicly proven.	Require either a lower entry price or stronger KPI proof.
Entry discipline	Strict	Ask for ARR, gross margin, customer concentration, backlog, and cap-table detail before moving beyond tracking.	No lead check at the current mark without that package.

This table summarizes the investment call, not a marketability score. The judgment is price-sensitive because the current valuation is public while operating metrics are not.

[CV031, CV032, CV033, CV034, CV035]

FV003: Valuation / return range

The evidence set supports wide scenario bands rather than point estimates because current revenue and margin are undisclosed.

All values are enterprise-value-style directional ranges in USD millions, anchored to milestones and directional comparable boundaries rather than direct revenue multiple math.

[CV025, CV026, CV027, CV028, CV029, CV043]

8.2 Investment thesis and anti-thesis

The positive thesis is real. Code Metal appears to occupy a meaningful wedge at the intersection of code modernization, hardware portability, and formal-verification-oriented trust. That wedge matters most in defense, aerospace, industrial, and semiconductor settings where failure costs are high and legacy code is hard to move. Public materials also support more than a pure concept story: the company has named reference accounts, technical depth, and repeated financing from both enterprise and defense-oriented investors. The anti-thesis is just as important. Public evidence still does not prove that verified translation is a budget-defining category rather than a premium feature inside broader modernization or security budgets. Competing routes include adjacent developer-security leaders, established assurance specialists, broad incumbents, and services-heavy federal integrators that already control budget and procurement pathways. If Code Metal cannot turn project-led migrations into repeatable software economics, then the right mental model is not a breakout developer platform but a narrower, more labor-intensive modernization vendor. That is why the bull story is understandable but not yet fully underwritten at the current price.[CV013, CV014, CV015, CV016, CV021, CV022]

Thesis / anti-thesis table
Dimension	Bull thesis	Anti-thesis	What would change the view
Market	Mission-critical modernization and verification can support a meaningful wedge inside larger software budgets.	The wedge may be too narrow or too services-heavy to support elite platform valuations.	Show repeatable expansion across several independent customers and sectors.
Product	Formal-verification-centric translation and hardware portability look differentiated versus generic copilots.	Differentiation may matter less than incumbent procurement reach or broader platform bundling.	Provide win-loss data where verification was the decisive buying criterion.
Customers	Named logos suggest relevance in high-stakes environments.	A short public logo list can hide concentration and investor-linked proof quality.	Disclose customer count, top-customer share, and pilot-to-production conversion.
Financials	Eight-figure contract language implies real demand at an early stage.	No public ARR, margin, burn, or revenue-mix data shows whether this is scalable software.	Open the KPI package and show software-led gross-margin progression.
Competition	Exact-match peers are limited, which can help a category leader emerge.	Adjacent platforms, incumbents, and services primes can absorb the same budget.	Show clear displacement against incumbents and internal-build alternatives.
Financing	Top-tier investors repeatedly funded the story at higher prices.	Fast round timing can also mean narrative outran public proof and raised future expectations.	Provide milestones tied to the next financing or cash-flow inflection.

Rows pair the strongest constructive read with the strongest evidence-backed counterargument. The right question is not whether the company is good, but whether the current price already discounts the good outcome.

[CV012, CV013, CV014, CV015, CV023, CV024]

FV001: Recommendation logic

The recommendation flows from a differentiated technical wedge and credible demand signals, but it is held back by KPI opacity and price already discounting a strong outcome.

This is a logic chain, not a quantified valuation model. It deliberately shows why the recommendation stops at research-more instead of buy.

[CV014, CV016, CV031, CV034, CV036, CV044]

FV004: Investment KPIs

IC-style scorecard of what looks strong versus what still blocks a clean investment call.

Scores are ordinal judgments out of 10 based only on retained public evidence as of the run date.

[CV013, CV014, CV016, CV032, CV034, CV036]

8.3 Scenario ranges and comparable boundaries

Because Code Metal does not disclose revenue, margin, or retention, comparables are best used as range boundaries instead of direct pricing formulas. Snyk and Sonar show what late-stage developer or code-quality platforms can command once investors can see hundreds of millions of revenue or explicit paths toward that scale. Diffblue offers a much smaller AI-for-code reference point, while IBM, Booz Allen, and SAIC matter less as valuation multiples than as reminders that large incumbents can absorb the same budget from different angles. GrammaTech and Galois show that trust and assurance have value, but they do not prove venture-scale software economics for Code Metal. The bull, base, and bear ranges therefore rest on milestones rather than faux certainty: audited growth and software leverage for the bull case, mixed-quality but real program momentum for the base case, and a repricing toward niche contractor-like outcomes for the bear case. Given the present disclosure mix, the base case deserves the highest weight, with real bear risk if metrics remain private into the next financing cycle.[CV017, CV018, CV019, CV020, CV021, CV022]

Bull / base / bear scenario table
Scenario	Core assumptions	Valuation range (USD B)	Probability signal	Downside or validation trigger
Bull	Management shows audited growth, software-led gross margins, diversified customers, and clear reuse of translation workflows.	1.5-2.4	Possible but not default; requires new KPI disclosure and proof of product leverage.	Validated by audited ARR, margin, and expansion data.
Base	Demand remains real, but economics still look mixed between product and high-touch delivery.	0.9-1.4	Most likely on current evidence because financing is clearer than monetization depth.	Holds if backlog grows but SaaS-style quality metrics remain only partly visible.
Bear	Market cools, services intensity remains high, or customer concentration and procurement friction block scaling.	0.4-0.8	Material risk because the public case still depends on narrative more than disclosed economics.	Triggered by weak KPI disclosure, flat growth, or a flat/down financing round.

These are milestone-based scenario ranges rather than direct revenue-multiple outputs. They are intentionally wide because public revenue and margin data are not available.

[CV025, CV026, CV027, CV028, CV029, CV043]

Comparable valuation table
Comparable	Metric	Multiple / valuation / status	Relevance	Limitation
Snyk	Developer security platform with $300M ARR and $278M 2024 revenue	Last disclosed private valuation $7.4B; still private and not rushing IPO	Useful upper-bound reference for a scaled developer-focused platform with visible revenue.	Broader security product, much later stage, and meaningfully more disclosed metrics than Code Metal.
Sonar	Code-quality platform with global reach	Raised $412M at $4.7B valuation in 2022; explicit ambition toward $1B revenue	Shows what a broader clean-code platform can command when scale is visible.	Not a mission-critical translation company; valuation is older and category breadth is wider.
Diffblue	AI-for-code testing specialist	Raised $6.3M in 2024 despite 326% net new ARR growth	Useful lower-scale private reference for narrow AI-for-code tooling.	Testing workflow is narrower and the round size says more about stage than about Code Metal’s exact economics.
IBM	Public modernization incumbent	Public enterprise with broad AI and modernization stack; not used as a direct multiple	Relevant because large accounts can satisfy similar needs through incumbent platforms.	Scale, installed base, and public-company profile make direct valuation comparison inappropriate.
Booz Allen / SAIC	Federal AI and mission-IT incumbents	Public services-heavy incumbents; directional budget competitors, not direct price comps	Important because they can win the same modernization budgets through existing procurement channels.	Services economics and federal contracting mix are very different from venture software expectations.
GrammaTech / Galois	High-assurance specialists	Private specialist references with long assurance track records but no public venture pricing markers	Relevant for trust credibility and substitution risk in regulated environments.	Lack of disclosed valuation data limits them to qualitative boundary markers.

This is a directional boundary set, not a clean comp basket. Each row is included for a specific lens—platform scale, AI-for-code narrowness, assurance credibility, or budget ownership.

[CV016, CV017, CV018, CV019, CV020, CV021]

FV002: Valuation sensitivity

Conviction is most sensitive to a short list of operating proofs that are still private.

Values are importance scores out of five rather than financial coefficients. Higher bars mean a driver has more influence on the underwriting range.

[CV018, CV025, CV030, CV035, CV039]

8.4 Recommendation, entry discipline, and final asks

The evidence-constrained answer is not to reject the company outright, but also not to underwrite the current valuation casually. Code Metal has enough technical differentiation, customer relevance, and financing support to justify serious diligence. What it does not yet have in public is the KPI transparency required for a buy call at $1.25 billion. The recommended posture is research-more with medium confidence, a high risk rating, and an expensive valuation stance. That means price sensitivity matters: either the next engagement needs a materially better entry point, or management needs to open a data room that proves recurring software revenue, margin structure, customer diversification, and cap-table economics. The key thesis-break condition is simple. If the company reaches the next financing milestone without showing that named demand is turning into repeatable software economics, then the current unicorn mark should be treated as fragile rather than self-validating. Final diligence should therefore stay focused on revenue quality, financing terms, customer concentration, and proof that verification-led delivery is becoming a scalable product rather than a bespoke service wrapper.[CV030, CV031, CV032, CV033, CV034, CV035]

Thesis-break and kill triggers table
Trigger	Threshold or event	Transmission to thesis	Action implication
Recurring software revenue still unproven	Management cannot show ARR, revenue mix, or gross margin progression before the next financing event.	Bull case collapses because product leverage remains hypothetical.	Stop progression or re-underwrite at a much lower valuation.
Customer concentration too high	One or two accounts drive an outsized share of revenue or backlog.	Named-logo proof becomes fragile and valuation durability weakens.	Treat as concentrated contractor risk, not platform risk.
Investor-linked proof dominates	A large share of marquee revenue comes from investor-adjacent customers such as RTX.	Reference quality weakens and independent demand looks thinner.	Require more independent customer proof before investing.
Services intensity remains high	Delivery still depends on heavy forward-deployed or bespoke engineering.	Gross-margin and scalability assumptions move toward technical-services outcomes.	Cut upside case and widen bear probability.
Procurement conversion stalls	Pilots or programs of record do not turn into growing funded deployments.	Go-to-market thesis weakens despite technical relevance.	Shift to track-only posture.
Next round is flat or defensive	Follow-on financing does not clear above the current mark or includes heavy structure.	Validates the concern that valuation outran evidence.	Use as a thesis-break or hard reset in price expectations.

These triggers are monitorable and map directly to the valuation story. They are designed to stop optimism from outrunning evidence.

[CV030, CV033, CV037, CV040, CV042]

Final diligence asks table
Topic	Missing evidence	Why it matters	Owner or diligence path
Revenue quality	ARR, recognized revenue, and revenue mix by software versus services	This is the missing bridge between customer logos and platform valuation.	Request CFO or board reporting package plus sample customer P&Ls.
Gross margin path	Current gross margin and margin split by product and services	Determines whether Code Metal scales like software or like bespoke delivery.	Review financial model and cohort economics with finance leadership.
Cash and burn	Current cash, quarterly burn, and runway assumptions	Shows whether the Series B bought enough time to prove repeatability before the next raise.	Request latest balance sheet, cash forecast, and board runway plan.
Cap table and preferences	Series A/B terms, liquidation preferences, secondary, debt, and pro rata rights	Headline valuation can overstate true return potential if structure is heavy.	Review financing docs and full cap table.
Customer concentration	Top-10 customer share and investor-linked revenue share	Validates the independence and durability of current demand proof.	Request cohort tables and customer concentration schedule.
Backlog and conversion	Funded backlog, renewal profile, and pilot-to-production conversion	Separates encouraging logos from durable monetization.	Request pipeline and backlog review by account.
Competitive displacement	Win-loss evidence versus IBM, Snyk, Sonar, services primes, and internal build	Sharpens both upside probability and moat durability.	Review procurement decks, anonymized loss notes, and references.
Proof of reuse	Evidence that later projects need less custom engineering than early ones	This is the crux of software leverage and margin expansion.	Review implementation timelines, reusable modules, and gross-margin cohorts by deployment vintage.

Every ask is tied to a specific underwriting uncertainty rather than generic curiosity. If management cannot answer these, the current valuation should remain a tracking item rather than an investment conviction.

[CV035, CV038, CV039, CV040, CV041]

8.5 Exhibits

Disclaimer

This report is for informational purposes only.

Evidence index

Claims
ID	Statement	Confidence	Sources
CO001	Code Metal publicly describes itself as providing verifiable code translation for mission-critical industries.	High	SO001, SO010
CO002	The product materials show translation and optimization workflows configured for specific CPUs, GPUs, FPGAs, toolchains, and resource constraints.	Medium	SO004
CO003	The retrieved About page says Code Metal is solving real problems with provable AI and is building a global team.	Medium	SO002
CO004	Public identity materials repeatedly place Code Metal in defense, automotive, semiconductor, industrial, and robotics contexts.	Medium	SO001, SO010
CO005	Code Metal was founded in 2023.	High	SO008, SO017
CO006	Boston is the strongest publicly supportable headquarters anchor for Code Metal.	High	SO010, SO018, SO021
CO007	Public hiring and investor materials imply a distributed footprint spanning Boston, San Francisco, remote roles, and possibly Washington, D.C.	Medium	SO003, SO014
CO008	Peter Morales is publicly identified as founder and CEO of Code Metal.	High	SO010, SO013, SO014
CO009	Alex Showalter-Bucher is publicly identified as a co-founder of Code Metal.	Medium	SO008, SO014
CO010	Founder background sources tie Morales and Showalter-Bucher to MIT Lincoln Laboratory and defense-system experience, including F-35-related work.	Medium	SO008, SO014
CO011	Ryan Aytay joined Code Metal in 2026 as President and COO after serving as CEO of Tableau.	High	SO010, SO013, SO018
CO012	Aytay's addition gives Code Metal a later-stage operating executive alongside its technical founders.	Medium	SO010, SO018
CO013	Retained public sources do not disclose a complete board roster, ownership structure, or investor-rights summary for Code Metal.	Medium	SO002, SO010, SO013
CO014	Key-person dependence remains material because public storytelling is concentrated around Morales and a small number of named leaders.	Medium	SO002, SO008, SO010, SO018
CO015	UCSD professor Loris D'Antoni publicly identifies himself as a Scholar at Code Metal.	Medium	SO024
CO016	Company research and independent technical materials frame Code Metal's differentiation around formal methods, verified lifting, and proof-backed code translation.	High	SO005, SO006, SO025, SO026
CO017	On 2024-07-23, Code Metal announced a $13 million seed round and disclosed a prior $3.45 million pre-seed round.	Medium	SO007
CO018	Shield Capital led the seed round and J2 Ventures led the pre-seed round.	Medium	SO007
CO019	Code Metal's Series A announcement stated that the company raised $36.5 million led by Accel at a $250 million valuation.	Medium	SO009
CO020	CNBC independently covered a November 2025 Accel-led financing for Code Metal at roughly $36 million, corroborating the timing while rounding down the amount.	Medium	SO016, SO021
CO021	SEC search results list Form D entries for Code Metal dated 2023-12-20, 2024-08-01, 2025-11-13, and 2026-03-12.	Medium	SO021
CO022	Code Metal's Series B announcement stated that the company raised $125 million at a $1.25 billion valuation led by Salesforce Ventures.	High	SO010, SO013, SO017, SO020
CO023	Public Series B materials name Accel, B Capital, Smith Point Capital, J2 Ventures, Shield Capital, Overmatch, and RTX as participants alongside Salesforce Ventures.	High	SO010, SO013, SO020
CO024	The Series A announcement named RTX Ventures, Bosch Ventures, Smith Point Capital, Overmatch VC, and AE Ventures as new investors, with Shield Capital and J2 Ventures also continuing.	Medium	SO009
CO025	Summing the disclosed pre-seed, seed, Series A, and Series B amounts yields about $177.95 million of publicly visible financing.	Medium	SO007, SO009, SO010
CO026	RTX is a publicly visible investor-customer overlap because RTX Ventures participated in the Series A and RTX was later named as a customer.	High	SO009, SO010, SO013, SO017
CO027	Code Metal's disclosed investor base mixes defense-oriented capital with enterprise-software investors.	Medium	SO007, SO010, SO014, SO015, SO022, SO023
CO028	By February 2026, public company-linked materials named Toshiba, RTX, L3Harris, and the U.S. Air Force as customers using Code Metal.	High	SO010, SO013, SO017
CO029	Wired also described Toshiba work and early customers including RTX, L3Harris, and the U.S. Air Force.	Medium	SO017
CO030	Series A materials claimed Code Metal was already on contract to deliver eight figures in revenue that year.	Medium	SO009
CO031	The retained public record does not disclose audited revenue, ARR, customer count, debt, or secondary-sale details for Code Metal.	Medium	SO009, SO010, SO013, SO021
CO032	The July 2024 launch article said Code Metal employed seven people at that stage.	Medium	SO008
CO033	The current careers page lists at least 17 named openings across engineering, research, operations, finance, and solutions.	Medium	SO003
CO034	Current hiring signals show Boston, San Francisco, and remote roles, while a Salesforce Ventures page also lists Washington, D.C. in the company's location footprint.	Medium	SO003, SO014
CO035	Code Metal hosted the Metal Ops hackathon in Boston from March 14 to March 16, 2025 around USSOCOM-related smart-city concepts.	Medium	SO012
CO036	Code Metal maintains a landing page saying Forbes covered the company, but the retrieved text does not expose a publication date or underlying article URL.	Medium	SO011
CO037	The company's current valuation and growth narrative relies heavily on company announcements, investor theses, and press pickup rather than audited operating disclosure.	Medium	SO009, SO010, SO013, SO014, SO015, SO017, SO020
CO038	Wired explicitly warned that methodologies behind AI code startups remain unproven and that investors are willing to gamble on a few eventual winners.	Medium	SO017
CO039	Investor-customer overlap and strategic corporate backers may improve access but also weaken the independence of some commercial proof points.	Medium	SO009, SO010, SO013, SO017
CO040	Exact current headcount is not publicly verified; the retained evidence only supports an older seven-person snapshot plus current hiring activity.	Medium	SO003, SO008
CO041	The retrieved About and Careers pages do not provide a complete named executive or technical-lead roster.	Medium	SO002, SO003
CO042	Official materials say customers use Code Metal to move between programming languages and optimize software for hardware at high speed.	High	SO010, SO013
CO043	B Capital's investment thesis says Code Metal's hybrid formal-methods and LLM approach is reliable enough for defense and industrial customers and trusted in production.	Medium	SO015
CO044	Salesforce Ventures wrote that Code Metal secured customers including L3Harris, Raytheon, and the U.S. Air Force in its first year of commercialization.	Medium	SO014
CO045	Across official and technical materials, the public technical narrative is anchored in making AI-generated code trustworthy through verified translation rather than through testing alone.	High	SO001, SO005, SO006, SO025
CM001	Code Metal’s product flow asks users to specify target CPUs, GPUs, FPGAs, toolchains, and runtime resource limits before translation work begins.	Medium	SM001
CM002	Code Metal’s formal-methods essay argues that testing alone is insufficient for safety- and mission-critical systems because passing tests does not prove behavior across all possible inputs.	Medium	SM002
CM003	Current company and investor materials consistently place Code Metal in defense, automotive, semiconductor, industrial, aerospace, and other mission-critical environments rather than in general consumer software.	Medium	SM009, SM010, SM011
CM004	Code Metal’s NVIDIA-portability research frames the problem as moving existing CUDA workloads onto other hardware architectures without losing performance or correctness.	Medium	SM003
CM005	Code Metal’s legacy-migration research describes translating Java, C, and C++ programs into target languages or DSLs so older code can run on newer hardware platforms.	Medium	SM005, SM006
CM006	DARPA’s HACMS program describes a high-assurance market spanning networked embedded systems such as SCADA, medical devices, communication devices, airplanes, and satellites.	Medium	SM013
CM007	The relevant category is therefore AI-driven modernization, translation, and verification for legacy or hardware-coupled mission software, not generic code generation.	Medium	SM001, SM002, SM003, SM013
CM008	IBM’s AI coding agent and similar enterprise coding tools position themselves around SDLC productivity, planning, execution, and governance, showing that generic coding agents are adjacent but broader than Code Metal’s wedge.	Medium	SM024
CM009	Adjacent assurance vendors such as Sonar, Snyk, Diffblue, GrammaTech, and Galois already sell code verification, security, testing, or software-assurance workflows into enterprise and high-stakes software teams.	Medium	SM021, SM022, SM023, SM028, SM030
CM010	The market boundary excludes pure greenfield copilot spend when hardware portability, formal verification, or certification evidence are not part of the buying trigger.	Medium	SM001, SM002, SM008, SM024
CM011	Grand View Research estimates the global application modernization services market at USD 17.8 billion in 2023 and USD 52.46 billion by 2030.	Medium	SM019
CM012	Grand View attributes modernization demand partly to the maintenance burden, support cost, and infrastructure complexity of legacy applications.	Medium	SM019
CM013	Mordor Intelligence estimates the application security market at USD 14.83 billion in 2026 and USD 28.11 billion by 2031.	Medium	SM020
CM014	Mordor says code scanning is increasingly embedded at every commit and across development, staging, and production environments, indicating that assurance tooling is already a standing budget category.	Medium	SM020
CM015	The DoD Software Modernization Implementation Plan says the department must maintain a competitive edge in an increasingly software-defined battlespace through faster and more resilient software delivery.	Medium	SM017
CM016	DoDI 5000.87 requires modern iterative software methods, encourages DevSecOps, and expects minimum viable capability releases within one year of initial funding with at least annual subsequent releases.	Medium	SM018
CM017	Retained public sources do not isolate a standalone published TAM for verified AI code modernization in defense, semiconductor, aerospace, and industrial edge software.	Medium	SM017, SM018, SM019, SM020
CM018	The most defensible sizing method is therefore a wedge across modernization budgets, assurance budgets, and mission-critical software programs rather than a single top-down category number.	Medium	SM017, SM018, SM019, SM020
CM019	A plausible near-term serviceable market for Code Metal-like platforms is roughly USD 0.6-1.8 billion of annual spend across defense primes, government software programs, semiconductor platform teams, and industrial or aerospace OEM software groups.	Low	SM011, SM017, SM018, SM019, SM020
CM020	A broader USD 1.8-4.5 billion TAM becomes plausible only if buyers fund both migration and ongoing verification across multiple architectures and regulated workflows instead of treating projects as one-off services.	Low	SM010, SM011, SM019, SM020
CM021	Those ranges should be treated as evidence-constrained bounds because public pricing, contract-conversion rates, and deployment-volume disclosures are thin.	Medium	SM017, SM018, SM019, SM020
CM022	In defense-prime settings, the most likely users are embedded, mission, toolchain, and verification engineers working inside long-lived program codebases.	Medium	SM001, SM010, SM011, SM026
CM023	Government program managers and sponsors materially influence payer decisions because official DoD software pathways tie releases, operational acceptance, and program execution to those roles.	Medium	SM017, SM018
CM024	Semiconductor and platform teams are logical buyers because Code Metal’s public materials emphasize heterogeneous hardware retargeting, toolchain integration, and moving workloads away from CUDA or architecture-specific implementations.	Medium	SM001, SM003, SM010
CM025	Industrial, aerospace, and robotics software teams fit the market because the relevant systems are embedded, hardware-coupled, and high consequence, matching the sectors highlighted by HACMS, Galois, GrammaTech, and Code Metal.	Medium	SM009, SM013, SM022, SM023
CM026	Systems integrators and government IT contractors are part of the buying path because Booz Allen and SAIC publicly market AI and modernization capabilities to national-security customers.	Medium	SM026, SM027
CM027	The likely commercial motion is enterprise or program funding rather than low-price seat expansion because deployments touch codebases, hardware constraints, build systems, and assurance workflows.	Medium	SM001, SM017, SM018, SM024
CM028	Adoption is likely to start with a bounded pilot or minimum viable capability release and expand only after integration, trust evidence, and operational acceptance are achieved.	Medium	SM017, SM018
CM029	Investor theses argue that manual rewrites in decades-old defense, aerospace, and semiconductor systems are slow, risky, and unscalable, which explains willingness to buy platform-level modernization tools.	Medium	SM010, SM011
CM030	Budget ownership is likely to sit with engineering, platform, modernization, or program leaders rather than with individual developers.	Medium	SM017, SM018, SM026, SM027
CM031	Status-quo alternatives include manual rewrites, specialist kernel or compiler teams, internal build-and-verify flows, and point assurance tools such as testing, security scanning, and software-analysis products.	Medium	SM003, SM021, SM022, SM028, SM030
CM032	CISA and NSA say organizations supporting national security systems and critical infrastructure should plan for memory-safe languages and more secure development approaches.	Medium	SM015, SM016
CM033	NIST is extending its AI risk-management work toward trustworthy AI in critical infrastructure, reinforcing that governance and trust requirements will accompany AI-enabled software adoption.	Medium	SM014
CM034	Code Metal’s own MATLAB-to-HDL translation research shows that even relatively narrow code-translation tasks benefit from carefully structured workflows and still require repair loops.	Medium	SM007
CM035	UniPar reports only 69 percent compilation success and 33 percent functional correctness after tuning for parallel-code translation, indicating that automated translation at scale still has material failure rates.	Medium	SM008
CM036	DARPA ARCOS says current DoD certification practices are antiquated and unable to scale because they rely on human evaluators and poorly decomposed assurance evidence.	Medium	SM012
CM037	Defense software adoption can be slowed by procurement and acceptance gates even when agile methods are encouraged, because releases still move through formal program structures and operational acceptance steps.	Medium	SM017, SM018
CM038	Large legacy repositories, bespoke toolchains, and heterogeneous hardware stacks make integration difficult enough that onboarding risk can remain services-heavy.	Medium	SM001, SM003, SM005, SM006
CM039	Code Metal’s Series B announcement says new capital will add engineering capacity, expand government and commercial partnerships, and scale go-to-market, implying that category development still needs meaningful field building.	Medium	SM009
CM040	The bull case is that portability plus verification plus mission-specific translation unlocks budgets that generic copilots cannot reach.	Medium	SM002, SM010, SM011
CM041	The bear case is that verification may not scale economically or independently enough across large mixed-language codebases and certification packages to support broad deployment.	Medium	SM008, SM012, SM025
CM042	Independent proof of broad deployment economics is still thinner than the category narrative because the strongest direct claims about trust, speed, and customer impact come mainly from company and investor materials.	Medium	SM009, SM010, SM011
CM043	The strongest growth drivers are hardware churn, legacy code debt, memory-safety pressure, secure-by-design requirements, and AI adoption inside regulated engineering organizations.	Medium	SM003, SM015, SM016, SM017, SM019
CM044	The strongest adoption constraints are long procurement cycles, certification bottlenecks, integration difficulty, missing pricing and conversion data, and limited public proof on scaled verification outcomes.	Medium	SM012, SM017, SM018, SM021, SM025
CM045	The central diligence question is whether a Code Metal-like platform can convert clear technical pain into repeatable product revenue rather than services-heavy bespoke engagements.	Medium	SM009, SM017, SM018, SM021
CM046	The sizing model should be read as a wedge from broad adjacent markets into a narrower mission-critical slice, not as a clean published industry taxonomy.	Medium	SM019, SM020
CM047	The highest-propensity early segments are organizations that face both hardware-portability pain and non-optional trust or assurance requirements.	Medium	SM001, SM002, SM003, SM012
CM048	Procurement flow is itself part of the product challenge because deployment requires integration, security evidence, and operational acceptance in addition to generated code.	Medium	SM017, SM018
CP001	Code Metal publicly describes itself as providing verifiable code translation for mission-critical industries.	High	SP001, SP007
CP002	Code Metal's product page says customers can choose target CPUs and accelerators including GPUs and FPGAs, indicating hardware portability beyond pure source-to-source translation.	High	SP002, SP006
CP003	Code Metal's research materials frame its approach as combining LLM-based translation with formal-verification-oriented validation rather than relying only on testing.	High	SP003, SP004, SP005
CP004	Company and investor materials place Code Metal in defense, semiconductor, automotive, robotics, and other mission-critical modernization workloads.	High	SP001, SP008, SP009
CP005	DARPA says current DoD software certification practices are antiquated and do not scale, supporting demand for tools that can automate high-assurance evidence generation.	High	SP010, SP011
CP006	NIST is extending AI risk-management guidance toward trustworthy AI in critical infrastructure, reinforcing that governance and assurance matter in regulated deployments.	High	SP012, SP013
CP007	GrammaTech markets software-assurance and cyber-security solutions and says it brings more than 30 years of cyber innovation.	High	SP014, SP015
CP008	GrammaTech's public positioning and Learn hub emphasize software analysis, cybersecurity, software assurance, vulnerability discovery, research, and services rather than AI-led hardware-portable code translation.	Medium	SP014, SP015, SP030
CP009	Galois markets high-assurance solutions and tools across aerospace and defense, healthcare, automotive, semiconductors, and fintech.	Medium	SP016
CP010	Relative to Code Metal's recent launch, Galois represents a more established high-assurance peer with a longer public track record.	Medium	SP016, SP028
CP011	Diffblue describes itself as an AI testing agent for enterprise unit testing, and its About page says the platform can autonomously write a unit test every 2 seconds.	High	SP017, SP018, SP029
CP012	Diffblue Cover and Diffblue's company materials center on automated unit-test generation and maintenance, which is narrower than Code Metal's translation-and-portability proposition.	Medium	SP017, SP018, SP029
CP013	Snyk Code is a developer-focused SAST product that finds, prioritizes, and auto-fixes unsafe code rather than translating legacy workloads across hardware targets.	High	SP019, SP020
CP014	Sonar says its platform is trusted by more than 7 million developers and 75% of the Fortune 100, and SonarQube is marketed as code verification for the AI era, giving Sonar a much larger publicly visible installed base than Code Metal.	Medium	SP021, SP031
CP015	IBM's AI coding agent page positions the product around building faster, modernizing legacy systems, and coordinating planning, execution, and verification across the SDLC; the watsonx Code Assistant URL resolves to the same enterprise coding-agent offer.	Medium	SP022, SP032
CP016	IBM watsonx is positioned as a governed enterprise AI stack that integrates with existing tools and infrastructure, expanding IBM's credibility beyond a point coding tool.	High	SP022, SP023
CP017	Booz Allen says it is the number one provider of AI solutions to the federal government and frames responsible AI as a core capability.	Medium	SP024
CP018	SAIC publicly markets data and artificial-intelligence services inside its mission IT portfolio, making it a services-led substitute for agencies choosing integrators over standalone software.	Low	SP025
CP019	Most listed competitors do not publicly offer Code Metal's exact combination of code translation, heterogeneous hardware targeting, and formal-verification-centric messaging.	Medium	SP001, SP002, SP003, SP014, SP016, SP017, SP019, SP021, SP022
CP020	Several competitors nonetheless bring stronger installed bases or institutional relationships than Code Metal, including Sonar's developer footprint, IBM's enterprise reach, and Booz Allen's federal AI relationships.	Medium	SP021, SP022, SP024, SP028
CP021	GrammaTech and Galois are the closest named peers on assurance credibility, but their public pages emphasize assurance solutions and research services more than AI-driven portability-focused code translation.	Medium	SP014, SP015, SP016, SP003
CP022	Diffblue, Snyk, and Sonar compete for adjacent testing, security, and code-quality budgets rather than the full modernization-and-portability job that Code Metal targets.	Medium	SP017, SP019, SP020, SP021
CP023	IBM is better understood as a broader modernization incumbent: it can combine AI coding assistance with watsonx governance and existing enterprise-tool relationships even though its public materials do not emphasize formal verification.	Medium	SP022, SP023
CP024	Booz Allen and SAIC are substitutes when defense buyers prefer services engagements or contractor-led modernization rather than adopting a new product vendor.	Medium	SP024, SP025
CP025	Large internal engineering teams can attempt internal builds using open-source or academic verified-program tooling plus general LLMs, making build-versus-buy a credible substitute.	Medium	SP005, SP026
CP026	Internal builds reduce vendor lock-in but shift integration, proof, and maintenance burden back onto the buyer.	Medium	SP010, SP012, SP026
CP027	Code Metal's likely budget owner is an engineering, modernization, or mission-software program team that cares about runtime migration and proof of correctness, not just developer convenience.	Medium	SP001, SP002, SP007, SP010
CP028	Snyk, Sonar, and Diffblue are more likely purchased from AppSec, DevSecOps, QA, or developer-productivity budgets than from the same modernization bucket Code Metal targets.	Medium	SP017, SP019, SP021
CP029	IBM, Booz Allen, and SAIC are positioned to sell through CIO, enterprise-platform, or program-office modernization budgets that already exist inside large institutions.	Medium	SP022, SP023, SP024, SP025
CP030	Public pricing is sparse across this landscape because most retained official pages describe capability, governance, or services posture rather than complete contract pricing.	Medium	SP014, SP016, SP017, SP019, SP021, SP022, SP024, SP025
CP031	Even when Code Metal wins on verification specificity, incumbents can compress competition through bundle economics or embedded procurement channels rather than like-for-like feature superiority.	Medium	SP021, SP022, SP023, SP024, SP025
CP032	Sonar's self-reported reach across 7 million developers and 75% of the Fortune 100 indicates a distribution moat that Code Metal cannot yet match publicly.	Medium	SP021
CP033	IBM's modernization pitch and watsonx governance stack suggest a deployment model centered on enterprise platform integration, whereas Code Metal emphasizes targeted translation into selected runtime environments.	Medium	SP002, SP022, SP023
CP034	If adopted deeply, Code Metal's switching costs likely come from translated code assets, target-hardware configuration knowledge, and verification workflows tied to specific modernization programs.	Medium	SP002, SP003, SP005, SP006
CP035	Those switching costs may begin lower than classic seat-licensed enterprise software because many buyers will first use Code Metal on project-specific migrations instead of company-wide standardization.	Low	SP002, SP007, SP028
CP036	Mission-critical deployments are likely to face heavier assurance and governance review than generic coding assistants because defense and critical-infrastructure programs tie software changes to certification and risk-management processes.	High	SP010, SP011, SP012
CP037	Some peers have longer formal-methods or assurance track records than Code Metal, which can matter in regulated procurement even if they lack Code Metal's exact feature mix.	Medium	SP014, SP015, SP016, SP028
CP038	Code Metal's moat durability depends on maintaining a visible verification advantage as adjacent vendors market code verification, testing, or governed AI as standard features.	Medium	SP003, SP005, SP021, SP022
CP039	Defense-services substitutes can beat Code Metal without matching the product because agencies may prioritize incumbent relationships and procurement convenience over adopting a new standalone workflow.	Low	SP024, SP025, SP028
CP040	Public evidence on realized pricing, contract values, and named competitive displacement win rates remains thin, so pricing comparisons should be treated as directional rather than conclusive.	Low	SP017, SP019, SP021, SP022, SP025
CP041	DARPA's ARCOS and HACMS programs show that defense software buyers value certifiable, high-assurance outcomes, a posture more aligned with Code Metal, Galois, and GrammaTech than with generic copilots.	High	SP010, SP011, SP014, SP016
CP042	Independent academic work documenting security weaknesses in Copilot-generated code strengthens the case for verification-oriented differentiation versus generic AI coding tools.	Medium	SP003, SP005, SP027
CI001	Code Metal publicly positions itself as a provider of verifiable code translation for mission-critical industries rather than a generic coding copilot.	Medium	SI001
CI002	The product page shows configuration around CPUs, GPUs, FPGAs, toolchains, and resource limits, making hardware portability a core part of the commercial use case.	Medium	SI002
CI003	Code Metal's research argues that testing alone is insufficient for safety- and mission-critical software and that stronger guarantees require formal methods.	Medium	SI004
CI004	The LLMLift research summary says Code Metal combines LLM translation with proof generation to establish functional equivalence across target languages and DSLs.	Medium	SI005
CI005	Code Metal's legacy-code migration post says LLMLift translates Java, C, and C++ into target languages for newer hardware while verifying the outputs.	Medium	SI007
CI006	Code Metal's NVIDIA portability post says conventional migration can require scarce target-architecture experts plus weeks of porting and tuning per kernel.	Medium	SI006
CI007	Code Metal maintains an active public research pipeline spanning verification, HPC translation, and domain-specific models, implying ongoing R&D investment beyond marketing copy.	Medium	SI022, SI023, SI024
CI008	The July 2024 seed announcement said Code Metal would expand experts in hardware, AI, and compiler design to serve a rapidly growing customer pipeline.	Medium	SI025
CI009	The Series A announcement said Code Metal had already deployed its platform across defense, automotive, and semiconductor industries.	High	SI009, SI014
CI010	The Series A announcement said Code Metal was already on contract to deliver eight figures in revenue that year.	High	SI009, SI014
CI011	Code Metal's November 2025 Series A was publicly reported as $36.5 million at a $250 million valuation.	High	SI009, SI014
CI012	Code Metal's February 2026 Series B was publicly disclosed as $125 million led by Salesforce Ventures with participation from Accel, B Capital, Smith Point, J2, Shield, Overmatch, RTX, and others.	High	SI010, SI011, SI026, SI027, SI028, SI029, SI030, SI031
CI013	The Series B use-of-funds language prioritized engineering capacity, product development, commercial and government partnerships, and go-to-market scale-up.	High	SI010, SI011, SI016, SI027, SI028
CI014	By February 2026, public official and independent coverage named Toshiba, RTX, L3Harris, and the U.S. Air Force as customers.	High	SI010, SI011, SI015
CI015	Salesforce Ventures said demand had already pulled Code Metal into programs of record across the U.S. Air Force, L3Harris, and more.	High	SI010, SI011, SI012
CI016	The public named-customer set spans enterprise or industrial accounts and government or defense programs rather than self-serve developer users.	Medium	SI010, SI011, SI012, SI015
CI017	Neither the homepage nor the product page publishes list pricing, packaged tiers, or self-serve checkout.	High	SI001, SI002
CI018	Retained public sources disclose no realized contract values, no discount schedules, and no revenue-recognition detail for Code Metal deals.	Medium	SI009, SI010, SI011, SI014
CI019	The careers page lists a Vice President of Finance, Facility Security Officer, Forward Deployed Engineer, Principal Solutions Architect, and multiple senior technical roles.	Medium	SI003
CI020	The mix of forward-deployed, solutions, and platform roles suggests customer onboarding and delivery still require hands-on implementation support.	Medium	SI003
CI021	The mix of formal-methods, compiler, ASIC verification, modeling, and HPC research signals a research-heavy labor base rather than a light-touch SaaS support model.	Medium	SI003, SI004, SI005, SI022, SI023
CI022	The seed and Series B materials both emphasize customer pipeline growth and partnerships instead of standardized public packaging, which is consistent with enterprise and government selling.	Medium	SI025, SI010, SI011
CI023	Salesforce Ventures frames the commercial problem as legacy code tied to old hardware plus a shortage of engineers fluent across legacy and modern stacks.	Medium	SI012
CI024	B Capital argues that defense and aerospace buyers need verified code because hallucinations, unchecked edge cases, or memory bugs can threaten national security and human life.	Medium	SI013
CI025	DARPA ARCOS says DoD software certification is antiquated, human-heavy, and hard to scale, which increases the value of automated assurance evidence.	Medium	SI019
CI026	DARPA HACMS says high-assurance cyber-physical systems benefit from formal-methods-based synthesis and machine-checkable proofs.	Medium	SI020
CI027	NIST's AI Risk Management Framework and its 2026 critical-infrastructure profile reinforce a trust-and-governance backdrop for verified AI in critical systems.	Medium	SI021
CI028	Public evidence supports a contract-led demand story, but not disclosed recurring-software revenue quality.	Medium	SI009, SI010, SI011, SI014
CI029	No retained public source discloses ARR, gross margin, burn, cash on hand, net retention, or total customer count.	Medium	SI009, SI010, SI011, SI014, SI015
CI030	Code Metal's portability value proposition is economic as well as technical because manual migration away from legacy or Nvidia-tied stacks is presented as slow and expert-constrained.	Medium	SI006, SI012, SI013
CI031	Code Metal's work on LLMLift, UniPar, and MonoCoder suggests reusable software or model leverage could emerge if current delivery standardizes into repeatable workflows.	Medium	SI005, SI022, SI023, SI024
CI032	Public materials do not disclose how much revenue comes from platform subscriptions versus bespoke translation projects or professional services.	Medium	SI001, SI002, SI009, SI010
CI033	The seed, Series A, and Series B disclosures together imply about $177.95 million of publicly visible financing.	Medium	SI025, SI009, SI010, SI011
CI034	The SEC EDGAR results page shows Code Metal Form D notices in 2023, 2024, twice in 2025, and 2026.	Medium	SI018
CI035	The move from the November 2025 Series A to the February 2026 Series B compressed the interval between major rounds to only a few months.	High	SI009, SI010, SI011, SI014, SI018, SI029, SI030
CI036	The $125 million Series B likely materially strengthened near-term runway, but capital adequacy still depends on undisclosed burn and on whether delivery remains services-heavy.	Medium	SI003, SI010, SI011, SI016
CI037	The investor base combines enterprise-software capital with defense and strategic investors, which can improve introductions and procurement credibility.	Medium	SI010, SI011, SI012, SI013, SI025
CI038	RTX is publicly named as both an investor and a customer, creating a visible overlap between commercial proof and strategic-capital support.	Medium	SI010, SI011, SI013
CI039	Wired said some methodologies behind AI code-tooling startups remain unproven and that investors are gambling that at least a few will work.	Medium	SI015
CI040	Wired and MassRobotics both reported that Code Metal pointed to work with a large chip company without naming the prospect, so some demand narration outruns disclosed contract evidence.	Medium	SI015, SI017
CI041	Public customer proof still rests on a short disclosed logo list plus company-described programs of record, leaving concentration and durability unresolved.	Medium	SI010, SI011, SI012, SI015, SI017
CI042	No retained public source quantifies government award dollars, funded backlog, or contract win rates for Code Metal.	Medium	SI010, SI011, SI012, SI018
CI043	The likely sales motion is long-cycle and high-touch because buyers operate in defense, semiconductor, automotive, and critical-infrastructure environments with procurement and validation friction.	Medium	SI010, SI011, SI012, SI013, SI019, SI020, SI021
CI044	Financial underwriting remains evidence-constrained because contract existence and fundraising momentum are visible while repeatable recurring revenue, margin path, and cash burn remain opaque.	Medium	SI010, SI011, SI014, SI015, SI018
CI045	The best public financial case is a company with credible early contracts, unusually fast capital access, and a differentiated verification thesis whose economics could improve if deployment becomes more reusable software over time.	Medium	SI005, SI010, SI011, SI015, SI022, SI023
CE001	Code Metal publicly describes itself as verifiable code translation for mission-critical industries.	Medium	SE001, SE014
CE002	The homepage says Code Metal unites high-level reasoning with low-level verification to produce tested optimized and compliant code.	Medium	SE001
CE003	The product workflow starts by loading high-level reference code written in Python Matlab or Julia through a Code Metal IDE plugin.	Medium	SE002
CE004	The intake layer is described as automatically tracking complex module and library dependencies while identifying low-level language equivalents.	Medium	SE002
CE005	Users are asked to define a target runtime by choosing CPU architecture accelerator mix resource constraints and preferred toolchains before generation begins.	Medium	SE002
CE006	Named runtime examples include x86 or ARM/RISC CPUs NVIDIA AMD and Qualcomm GPUs multiple FPGA families and toolchains such as ONNX and Vivado.	Medium	SE002
CE007	Code Metal says its agentic workflow generates a transpilation and deployment plan for the selected edge environment.	Medium	SE002, SE016
CE008	The product page says generated outputs can include embedded C/C++ or Rust for CPUs synthesizable VHDL or Verilog for FPGAs and CUDA HIP or ONNX-oriented GPU code.	Medium	SE002
CE009	The public workflow promises alternative generated variants optimized for memory runtime performance code size or power consumption.	Medium	SE002
CE010	Code Metal says it tracks changes between generated and input code integrates with standard IDEs and versioning tools and can suggest matching upstream annotations after manual edits.	Medium	SE002
CE011	The homepage also markets deploy-to-any-chip portability making hardware retargeting a first-order product value rather than a side feature.	Medium	SE001, SE015
CE012	Public about and careers pages show a team identity anchored in formal methods compiler design ASIC verification compiler and AI tooling DevOps solutions architecture and forward deployed engineering.	Medium	SE003, SE004
CE013	The July 2024 seed announcement says Code Metal is building modular and verifiable agentic workflows for edge development.	Medium	SE016
CE014	The seed announcement also says the developer platform integrates traditional formal-methods-based code analysis with advanced custom coding language models.	Medium	SE016
CE015	Series A messaging distinguishes the product from generic vibe coding by promising zero-error production code onto hardware.	Medium	SE015, SE028
CE016	Official Series B and Salesforce Ventures materials describe the product as neuro-symbolic and say it mathematically proves code is correct rather than only predicting code tokens.	Medium	SE014, SE018
CE017	Official and investor materials repeatedly position Code Metal in defense aerospace semiconductor automotive and other regulated or mission-critical environments.	Medium	SE014, SE018, SE019
CE018	Series B materials say customers use the product to move between programming languages and optimize software for hardware.	Medium	SE014, SE017
CE019	Named public customer references include Toshiba RTX L3Harris and the U.S. Air Force.	Medium	SE014, SE017
CE020	The Series B announcement says new capital will add engineering capacity accelerate product development expand commercial and government partnerships and scale go-to-market capabilities.	Medium	SE014, SE017
CE021	Code Metal's formal-methods explainer argues that testing only shows behavior on tried inputs and cannot establish the absence of bugs.	Medium	SE006
CE022	The same explainer defines formal methods as reasoning over every execution permitted by a system against a specification.	Medium	SE006
CE023	The explainer also says formal methods historically struggled to scale because machine-checked proofs were expensive and required deep expertise.	Medium	SE006
CE024	Code Metal frames its own trust problem as proving behavior preservation when translating CUDA to OpenCL M files to VHDL or legacy C++ to Rust.	Medium	SE006
CE025	LLMLift is described as combining LLM-driven translation with proof generation to produce functionally equivalent transpilation for target DSLs.	Medium	SE007, SE023
CE026	The arXiv HTML abstract says prior verified-lifting tools were specialized to narrow source-target pairs or required significant domain expertise to make search efficient.	Medium	SE023
CE027	Code Metal's LLMLift migration note says Tenspiler exposed a scaling problem because transpilation time rose with input and output complexity under enumerative search.	Medium	SE008
CE028	The same note says encoding optimizations in Tenspiler required roughly 1200 lines of manual logic plus considerable target-domain knowledge.	Medium	SE008
CE029	Tenspiler publicly claims support for six DSLs across a broad range of software and hardware environments through a TensIR intermediate representation.	Medium	SE024
CE030	Metalift publicly exposes a Python API over LLVM analysis with Rosette and the CVC5 theorem prover as core synthesis and verification substrate.	Medium	SE025
CE031	Alvin Cheung's public research summary positions verified lifting as a formal-methods-plus-deep-learning line of work spanning compilers and data-processing systems.	Medium	SE026
CE032	Loris D'Antoni's public page links Code Metal's scholarly orbit to specification-aligned LLMs and compiler synthesis research aimed at trusted software.	Medium	SE027
CE033	UniPar evaluates serial CUDA and OpenMP translation using fine-tuning hyperparameter tuning and compiler-guided repair rather than raw prompting alone.	Medium	SE011
CE034	gpuFLOPBench evaluates LLM reasoning on 577 CUDA kernels and finds materially weaker performance on division math functions and shared subexpressions than on straightforward kernels.	Medium	SE012
CE035	The workflows-versus-agents paper compares structured and agentic syntax-repair flows on 42 MATLAB-to-HDL functions implying orchestration still matters by task and model size.	Medium	SE010
CE036	The NVIDIA-portability note reports validated translations from CUDA to OpenCL on Qualcomm Adreno GPUs and from serial CPU kernels to Hexagon Vector Extension NPUs.	Medium	SE009
CE037	The same portability note says generated kernels were validated as correct and early GPU or NPU cases sometimes beat baselines or exceed 100% of target performance.	Medium	SE009
CE038	That portability research also labels the results preliminary and explicitly says reinforcement learning and human-in-the-loop training remain future improvement levers.	Medium	SE009
CE039	Public support evidence includes platform DevOps roles for CI/CD and cloud-plus-on-prem operation along with Solutions Architect and Forward Deployed Engineer roles.	Medium	SE004
CE040	The careers page also lists a Facility Security Officer role implying operational support for security-cleared or government-adjacent delivery contexts.	Medium	SE004
CE041	The news index and linked media-wrapper pages show an active external-facing cadence through February 2026 including Series B CNBC Wired and TBPN surfaces.	Medium	SE013, SE028, SE029, SE030
CE042	Code Metal's privacy policy promises reasonable security measures but explicitly says it cannot guarantee that transmission or storage is 100% secure.	Medium	SE005
CE043	Across the retained public pages reviewed here Code Metal does not disclose a named security certification public uptime or status page or quantified proof-coverage metric for customer deployments.	Medium	SE001, SE002, SE005, SE013
CE044	Official and investor copy repeatedly frames compliance safety and production-readiness as the reasons buyers need verification beyond ordinary AI code generation.	Medium	SE014, SE017, SE018, SE019
CE045	DARPA ARCOS and HACMS show that composable assurance evidence theorem provers model checkers and formal specifications are standard reference points for defense-grade software assurance.	Medium	SE020, SE021
CE046	NIST's AI RMF and its 2026 critical-infrastructure profile note show that trustworthy-AI deployment in critical systems is governed as a risk-management problem not only a model-accuracy problem.	Medium	SE022
CE047	Code Metal's research surface spans LLMLift portability benchmarks UniPar gpuFLOPBench agentic repair studies and MonoCoder indicating a roadmap still shaped by active experimentation across translation benchmarking and specialized models.	Medium	SE007, SE009, SE010, SE011, SE012, SE031
CE048	Public evidence supports a workflow-first product more than a neatly published SKU list so the module map must be reconstructed from product research and hiring surfaces.	Medium	SE002, SE003, SE004
CE049	The strongest publicly evidenced integration surfaces are IDE plugins version-control friendliness configurable build targets and customer-facing support roles rather than self-serve API or marketplace documentation.	Medium	SE002, SE004
CE050	Code Metal's differentiation today rests on the combined promise of hardware portability formal verification and academic compiler lineage rather than on breadth of public compliance certifications.	Medium	SE014, SE018, SE019, SE025, SE026, SE027
CU001	Code Metal's homepage names industrial, automotive, semiconductor, defense, and robotics as visible industry surfaces.	Medium	SU001
CU002	The product page shows buyers configuring CPUs, GPUs, FPGAs, and toolchains, which points to hardware-aware deployment work rather than generic code assistance.	Medium	SU002
CU003	Open roles for a Facility Security Officer, Forward Deployed Engineer, and Principal Solutions Architect indicate a customer motion that includes security review and hands-on deployment support.	Medium	SU003
CU004	By February 2026, Code Metal said its technology was already deployed across defense, automotive, semiconductor, and other mission-critical industries.	High	SU004, SU005
CU005	Code Metal publicly named Toshiba, RTX, L3Harris, and the U.S. Air Force as customers in its February 2026 Series B materials.	High	SU004, SU005
CU006	Salesforce Ventures said demand had already pulled Code Metal into programs of record across the U.S. Air Force and L3Harris.	High	SU004, SU005, SU006
CU007	B Capital said Code Metal had won customers including the U.S. Air Force, L3Harris, Toshiba, and RTX.	Medium	SU007
CU008	Wired reported that early customers included L3Harris, RTX, and the U.S. Air Force, and that the company was also working with Toshiba.	Medium	SU009
CU009	The July 2024 seed announcement said Code Metal already had strategic partnerships with X-Press Feeders and L3Harris and was generating revenue.	Medium	SU011
CU010	The seed announcement included a direct outside quote from HICO's Chris Hartnoll describing Code Metal as transformational in building intelligence into a logistics network.	Medium	SU011
CU011	In November 2025, Code Metal said it was already on contract to deliver eight figures in revenue that year.	High	SU010, SU008
CU012	Shield Capital said Code Metal had sped code translation from weeks to days for L3Harris across several projects.	Medium	SU010
CU013	GeekWire reported Ryan Aytay's move to Code Metal and quoted him describing rare real customer demand.	Medium	SU014
CU014	Accel describes Code Metal as AI developer tools for edge environments and says its initial investment was the 2025 Series A.	Medium	SU013
CU015	The February 2026 Series B materials said the new funding would expand commercial and government partnerships and scale go-to-market capabilities.	High	SU004, SU005
CU016	B Capital explicitly framed telecommunications, semiconductor manufacturers, automotive, industrial equipment, and other regulated spaces as target enterprise domains beyond defense.	Medium	SU007
CU017	The combination of homepage industry messaging and product-level hardware configuration supports a buyer set that includes semiconductor and platform teams, not just defense software groups.	Medium	SU001, SU002
CU018	DARPA's HACMS program is official evidence that high-assurance military software remains a real procurement and technical requirement in defense contexts.	Medium	SU017
CU019	NIST's AI Risk Management Framework is official evidence that high-stakes AI deployments carry structured risk and governance expectations.	Medium	SU018
CU020	The LLMLift paper describes porting code to DSLs for GPUs, machine-learning accelerators, and network processors as a hard problem where manual rewriting is bug-prone.	Medium	SU015
CU021	The Tenspiler paper independently reinforces that verified lifting for specialized tensor and hardware-oriented targets is a real software-engineering need.	Medium	SU024
CU022	Karpathy's May 2026 post described rising momentum behind porting C to Rust and upgrading legacy code bases, which supports broad market pull for translation work beyond one company.	Medium	SU021
CU023	Code Metal's public customer proof is dominated by official releases, investor theses, and media-amplification posts rather than by a large set of direct customer case studies.	High	SU004, SU005, SU006, SU007, SU022, SU023
CU024	The retained public sources do not disclose a total customer count or an active-account denominator.	High	SU004, SU005, SU006, SU007, SU008, SU009, SU010
CU025	The retained public sources do not disclose NRR, GRR, churn, contract length, or renewal rates.	High	SU004, SU005, SU006, SU007, SU008, SU009, SU010, SU014
CU026	The role mix, partnership-expansion language, and COO scale-up imply a long-cycle, high-touch enterprise and government sales motion.	High	SU003, SU004, SU005, SU014
CU027	If customers complete hardware-specific integration and verification work with Code Metal, the resulting deployments are likely to be sticky because re-validation across chips and toolchains is costly.	High	SU002, SU015, SU017, SU018, SU024
CU028	The best public repeat-use proxies are programs-of-record language for the U.S. Air Force and L3Harris and the description of several L3Harris projects.	High	SU005, SU006, SU010
CU029	Public proof for the U.S. Air Force is material but incomplete because the retained pack names the customer without giving contract IDs, award dollars, or direct program documents.	High	SU004, SU005, SU006
CU030	Public proof for L3Harris is stronger than generic logo use because the pack includes programs-of-record language and a several-projects performance claim.	High	SU005, SU006, SU010
CU031	RTX is both a named customer and an investor, which weakens its value as an independent reference account.	High	SU004, SU005, SU007
CU032	Toshiba is the clearest public industrial or electronics logo, but the retained pack provides no public outcome metric or direct user quote for that account.	High	SU004, SU005, SU007, SU009
CU033	X-Press Feeders and the HICO-linked quote provide the clearest customer-quoted proof surface in the public pack, but that evidence dates to 2024 and predates the later defense-heavy narrative.	Medium	SU011
CU034	Secondary coverage repeatedly recycles the same small named-customer set rather than surfacing many new reference accounts.	Medium	SU009, SU019, SU020, SU025
CU035	A low-tier secondary article relayed management's claim that every deployed pilot goes to the next phase and that the company is profitable, but it did not provide customer-level proof.	Low	SU019
CU036	Wired reported that Code Metal negotiates pricing individually based on development time, lines translated, or time saved.	Medium	SU009
CU037	Wired reported that Code Metal was in talks with a large unnamed chip company about portability across chip platforms.	Medium	SU009
CU038	AICOSoft and Give Me Technology both framed Code Metal around the defense problem of old mission software and scarce legacy-language talent.	Medium	SU019, SU020
CU039	Official and investor sources name top-tier government and enterprise customers, but they do not disclose whether those relationships are direct prime contracts, subcontracted work, or scoped pilots.	High	SU004, SU005, SU006, SU007
CU040	Across official and investor sources, Code Metal's expansion thesis spans defense, automotive, semiconductor, telecom, industrial equipment, and other regulated industries.	High	SU004, SU005, SU007, SU010
CU041	Independent customer reviews, public satisfaction scores, and detailed third-party case studies were not found in the retained customer pack.	Medium	SU001, SU022, SU023
CU042	Customer durability is therefore still a diligence hypothesis rather than a public metric-backed fact.	Medium	SU004, SU005, SU009, SU014
CU043	The combination of support-oriented roles and a new enterprise operator suggests the customer base still needs meaningful onboarding and account management attention.	High	SU003, SU004, SU014
CU044	Customer concentration risk is high in public evidence because the named logo list is short and heavily weighted toward defense and mission-critical accounts.	High	SU004, SU005, SU006, SU007, SU008, SU009
CU045	The SEC filing page adds financing chronology but no customer operating detail, underscoring how little third-party primary documentation exists for customer economics.	Medium	SU012
CU046	Official and investor materials consistently frame Code Metal as valuable where software failure has compliance, safety, or mission consequences.	High	SU004, SU005, SU007, SU016
CU047	Code Metal's official web surface is more focused on fundraising, research, and press amplification than on a deep library of customer stories.	High	SU001, SU004, SU022, SU023
CU048	J2 Ventures' companies page and Shield Capital's podcast hub show that additional accessible third-party surfaces around Code Metal still sit inside investor-owned national-security and startup ecosystems rather than inside customer-authored proof channels.	Medium	SU005, SU026, SU027
CU049	The retained USAspending search page is an official awards-search surface, but it does not itself disclose a Code Metal award record, contract vehicle, or funded program in the prepared pack.	Low	SU028
CU050	The retained SAM.gov entity-search page likewise does not itself expose an entity detail page, contract trail, or procurement structure for Code Metal in the prepared pack.	Low	SU029
CU051	Even after checking retained USAspending and SAM search pages, independent procurement verification remains incomplete because those public search surfaces do not tie Code Metal's Air Force or defense-prime claims to award-level paper.	Medium	SU006, SU028, SU029
CR001	Code Metal publicly positions itself for defense, automotive, semiconductor, industrial, and robotics contexts where software failure carries high consequences.	Medium	SR001, SR002
CR002	Public materials repeatedly frame the product as verifiable code translation rather than a generic code copilot.	Medium	SR001, SR006
CR003	By February 2026, public company and independent sources named Toshiba, RTX, L3Harris, and the U.S. Air Force as customers.	Medium	SR010, SR011, SR012, SR015
CR004	Publicly disclosed backers by the Series B included Salesforce Ventures, Accel, B Capital, Smith Point, J2 Ventures, Shield Capital, Overmatch, and RTX.	Medium	SR010, SR011
CR005	Code Metal announced a $36.5 million Series A at a $250 million valuation in November 2025.	Medium	SR009, SR014
CR006	Code Metal announced a $125 million Series B at a $1.25 billion valuation in February 2026.	Medium	SR010, SR011, SR015
CR007	The jump from the Series A to the Series B compressed the burden of proof onto future execution because valuation scaled faster than public operating disclosure.	Medium	SR011, SR014, SR015
CR008	The EU AI Act subjects high-risk AI uses and GPAI models to documentation, risk-management, transparency, copyright, and safety-security obligations.	Medium	SR024
CR009	The AI Act explicitly treats AI safety components in critical infrastructure as high-risk, which aligns uncomfortably with Code Metal's mission-critical positioning.	Medium	SR001, SR024
CR010	The AI Act's transparency rules requiring AI-generated content identifiability take effect in August 2026.	Medium	SR024
CR011	EU compliance support now includes a GPAI Code of Practice and a training-content summary template, which increases documentation burden around model provenance.	Medium	SR024
CR012	The U.S. Copyright Office released a pre-publication Part 3 report on generative AI training in May 2025, underscoring unresolved training-data policy scrutiny.	Medium	SR025
CR013	Code Metal's privacy policy covers website-level personal and usage data collection but does not publicly resolve how customer code, model-training boundaries, or retention operate in production deployments.	Medium	SR004, SR002
CR014	Public sources reviewed did not disclose a litigation history, patent portfolio, or licensed-data framework that would close provenance questions around academic and model inputs.	Low	SR004, SR025, SR026, SR028
CR015	Verified Code Transpilation with LLMs states that prior LLM transpilation approaches lacked functional-correctness guarantees, which is the technical gap Code Metal claims to solve.	Medium	SR026
CR016	The LLMLift paper describes generating proofs of functional equivalence across four DSLs, showing real assurance progress but within bounded source-target domains.	Medium	SR026
CR017	Tenspiler shows verified lifting can span six DSLs and hardware or software environments, but expansion still depends on explicit intermediate representations and backend rules.	Medium	SR027
CR018	Code Metal's product flow asks users to specify target CPUs, GPUs, FPGAs, toolchains, and resource limits before translation begins.	Medium	SR002
CR019	Code Metal's NVIDIA portability research says conventional migration can require scarce architecture experts and weeks of tuning per kernel.	Medium	SR034
CR020	Counting Without Running reports that modern LLMs do well on simple kernels but struggle on complex reasoning cases involving division, math functions, and shared subexpressions.	Medium	SR008
CR021	Those benchmark limits imply that performance-critical or hardware-specific generated code still needs human validation and measurement rather than trusting model reasoning alone.	Medium	SR008, SR002
CR022	The careers page lists a Facility Security Officer, Forward Deployed Engineer, Principal Solutions Architect, and multiple senior compiler or formal-methods roles.	Medium	SR003
CR023	That hiring mix implies a high-touch delivery model with security handling, customer-specific integration, and ongoing support obligations.	Medium	SR003, SR002
CR024	DARPA says current DoD software certification practices are antiquated and do not scale, making automated assurance evidence economically important but slow to institutionalize.	Medium	SR018
CR025	HACMS describes a defense context that values resilient, high-assurance cyber systems even against zero-day exploits, which raises the bar for generated-code acceptance.	Medium	SR019
CR026	NIST's AI RMF and CISA's memory-safe-language guidance show public-sector software trust standards are moving toward documented risk management and safer development practices.	Medium	SR020, SR021
CR027	Salesforce Ventures said demand had already pulled Code Metal into programs of record across the U.S. Air Force and L3Harris.	Medium	SR012
CR028	Because the public customer list remains short, concentration risk cannot be dismissed from retained evidence alone.	Medium	SR011, SR012, SR015
CR029	RTX is publicly visible as both an investor and a customer, creating overlap between commercial validation and strategic-capital support.	Medium	SR009, SR011, SR015
CR030	J2 Ventures and Shield Capital both frame themselves around national-security technology ecosystems, reinforcing the defense-network character of the cap table.	Medium	SR030, SR031, SR011
CR031	Public federal search pages retained for USAspending and SAM do not by themselves confirm a direct prime-award history or clean entity-disclosure trail for Code Metal.	Low	SR032, SR033
CR032	The hardware-portability thesis depends on heterogeneous compute and toolchain pain staying severe enough that buyers prefer Code Metal over manual rewrites or internal tooling.	Medium	SR002, SR026, SR034
CR033	Founder background and launch materials tie Peter Morales and Alex Showalter-Bucher to MIT Lincoln Laboratory and defense-software work.	Medium	SR009, SR014
CR034	UCSD professor Loris D'Antoni publicly identifies himself as a Scholar at Code Metal.	Medium	SR029
CR035	The LLMLift paper authorship ties core verified-lifting ideas to UC Berkeley researchers and industry labs such as Intel Labs and Duolingo, showing valuable but not obviously exclusive academic lineage.	Medium	SR026, SR028
CR036	Alvin Cheung's Berkeley page says verified-lifting work from his group is deployed at Adobe and Google, suggesting the underlying techniques are broader than any single startup.	Medium	SR028
CR037	Public leadership visibility improved with Ryan Aytay's arrival as President and COO, but the public record remains thin on board composition and named bench depth below the founders.	Medium	SR016, SR005
CR038	Recruiting formal-methods, compiler, ASIC-verification, and forward-deployed talent is likely difficult because the role mix is unusually specialized.	Medium	SR003, SR026
CR039	Public sources do not disclose ARR, gross margin, burn, cash on hand, customer-count denominator, or retention metrics.	Medium	SR010, SR011, SR014, SR015
CR040	The Series A statement about eight-figure contracted revenue is a demand signal, not proof that recurring software revenue already dominates delivery services.	Medium	SR009, SR014
CR041	The combination of hardware-specific configuration, forward-deployed hiring, and bounded research domains suggests the services-to-platform conversion is still in progress.	Medium	SR002, SR003, SR026
CR042	SEC EDGAR shows Form D activity in 2023, 2024, twice in 2025, and 2026, confirming repeated capital formation.	Medium	SR017
CR043	Repeated financing materially improves near-term survivability but does not by itself disclose runway or cash efficiency.	Medium	SR017, SR011, SR015
CR044	Defense-oriented procurement and acceptance cycles can delay revenue recognition and expansion even when technical evaluations are positive.	Medium	SR012, SR018, SR022, SR023
CR045	Code Metal's mitigation story is strongest on formal verification, proof generation, and mission-specific investor access rather than on published production reliability metrics.	Medium	SR001, SR006, SR026, SR011
CR046	The most important monitor is whether named reference accounts expand into broader recurring deployments rather than remain a small set of high-effort flagship programs.	Medium	SR011, SR012, SR015
CR047	A thesis-break event would be evidence that certification, integration, or security review effort offsets Code Metal's claimed speed and portability advantages.	Medium	SR018, SR019, SR020, SR021
CR048	A second thesis-break event would be a financing reset before public software metrics catch up to the $1.25 billion valuation.	Medium	SR011, SR015, SR017
CR049	A third thesis-break event would be loss of a founder or key research affiliate without visible succession depth.	Medium	SR016, SR026, SR029
CR050	Procurement verification remains a diligence ask because retained public records do not disclose contract vehicle, award dollar amount, or whether programs sit on prime or subcontract paper.	Low	SR012, SR032, SR033
CR051	The European Commission's AI Act FAQ describes a uniform EU-wide regime with evolving high-risk implementation guidance, implying Code Metal's compliance burden is ongoing rather than a one-time localization exercise.	Medium	SR024, SR035
CR052	The Copyright Office's January 2025 copyrightability report says purely AI-generated material is not copyrightable without sufficient human control and leaves training, licensing, and liability to a subsequent report, keeping output ownership and indemnity questions open.	Medium	SR025, SR036
CR053	GAO's 2026 review found federal agencies struggle to access AI technical experts, estimate AI-related costs, and often acquire AI as an ongoing service, all of which can lengthen procurement and increase support expectations.	Medium	SR037
CR054	GAO's 2023 DOD report found AI acquisitions lacked department-wide guidance and highlighted intellectual-property and data-rights concerns in AI contracting, raising contract-review burden for mission software vendors.	Medium	SR038
CR055	CISA's Secure by Design program argues technology providers should bear more of the cybersecurity burden and ship secure-by-default products, which raises the evidence bar for vendors selling AI-generated code into critical workflows.	Medium	SR039
CR056	CISA explicitly says AI is software and must be Secure by Design across design, deployment, vulnerability management, incident management, and end-of-life, broadening the diligence scope beyond model-output quality alone.	Medium	SR040
CV001	Code Metal announced a $125 million Series B at a $1.25 billion valuation in February 2026.	High	SV003, SV004
CV002	Independent pickup coverage repeated the $125 million Series B and $1.25 billion valuation headline, but did not add audited operating metrics behind the price.	Medium	SV004, SV029, SV030
CV003	Code Metal announced a $36.5 million Series A at a $250 million valuation in November 2025.	High	SV001, SV002
CV004	CNBC independently covered the Series A as roughly $36 million led by Accel, which corroborates the round while showing normal press-rounding on the amount.	Medium	SV001, SV002
CV005	The disclosed valuation stepped up about 5x from $250 million at Series A to $1.25 billion at Series B in roughly three months.	High	SV001, SV002, SV003, SV004
CV006	The publicly disclosed pre-seed, seed, Series A, and Series B rounds sum to about $177.95 million of visible financing.	Medium	SV031, SV001, SV003, SV004
CV007	Code Metal positions itself as a verifiable code-translation platform for mission-critical systems rather than as a generic coding copilot.	High	SV024, SV025
CV008	By the Series B, public materials named Toshiba, RTX, L3Harris, and the U.S. Air Force as customers or deployed accounts.	High	SV003, SV004, SV029, SV030
CV009	Series A materials said Code Metal was already on contract to deliver eight figures in revenue that year.	High	SV001, SV002
CV010	No retained public source discloses Code Metal’s ARR, gross margin, burn, cash balance, net revenue retention, or total customer count.	Medium	SV001, SV003, SV004, SV005, SV006, SV007, SV008
CV011	Retained public sources do not disclose liquidation preferences, secondary mix, debt, or other cap-table overhang terms for the Series B.	Medium	SV003, SV004, SV008
CV012	The current price is therefore supported more by investor quality, customer logos, and narrative conviction than by public operating disclosure.	Medium	SV005, SV006, SV007, SV008, SV029, SV030
CV013	Large adjacent modernization and application-security budget pools mean a real mission-critical translation wedge could scale if Code Metal proves repeatability.	Medium	SV026, SV027
CV014	Code Metal’s product and research materials emphasize hardware portability and formal verification, which differentiates the offering from broad AI coding assistants.	Medium	SV024, SV025, SV006, SV007
CV015	Public evidence does not yet prove that buyers will consistently pay a premium for verified translation over broader modernization or security tooling.	Medium	SV005, SV017, SV019, SV020
CV016	The relevant comparable set spans adjacent developer-security platforms, high-assurance specialists, modernization incumbents, and defense-services substitutes rather than a single exact peer group.	Medium	SV012, SV015, SV017, SV019, SV020, SV022, SV023
CV017	Snyk provides an upside boundary for scaled developer tooling because it reportedly reached $300 million ARR while its most recent disclosed private valuation remained $7.4 billion.	Medium	SV009, SV011
CV018	Snyk’s 2024 growth slowdown to 26% revenue growth, coupled with a large operating loss, shows that even scaled developer platforms can face valuation compression and delayed IPO timing.	Medium	SV009, SV010
CV019	Sonar shows what investors can pay for a broad code-quality platform when scale is visible: $412 million raised at a $4.7 billion valuation with an explicit path toward $1 billion in revenue.	High	SV012, SV013, SV014
CV020	Diffblue is a smaller AI-for-code reference point, raising $6.3 million in 2024 even while reporting 326% net new ARR growth from the prior six months.	Medium	SV015, SV016, SV028
CV021	IBM, Booz Allen, and SAIC are not price comps for Code Metal, but they are budget competitors with broader enterprise or federal modernization distribution.	Medium	SV017, SV018, SV019, SV020, SV021
CV022	GrammaTech and Galois reinforce that assurance credibility can also come from established vendors with much longer public track records than Code Metal.	Medium	SV022, SV023
CV023	The bull case requires Code Metal to convert bespoke migrations into reusable software while expanding beyond a short list of public reference logos.	Medium	SV003, SV004, SV024, SV025
CV024	The base case assumes Code Metal keeps winning relevant programs but remains partly services-led and still discloses only limited public unit economics.	Medium	SV001, SV002, SV005, SV008
CV025	The bear case is a flat or down round if private markets reprice AI code infrastructure before Code Metal can show repeatable software economics.	Medium	SV009, SV010, SV011, SV018
CV026	Because revenue is undisclosed, a multiple-based valuation model would imply false precision, so scenario ranges are more defensible than direct revenue-multiple math.	Medium	SV001, SV003, SV004, SV005, SV008
CV027	A defensible bull range is roughly $1.5 billion to $2.4 billion if Code Metal proves audited growth, better-than-services gross margins, and broader customer independence.	Medium	SV003, SV004, SV009, SV012, SV016
CV028	A defensible base range is roughly $0.9 billion to $1.4 billion if growth continues but evidence still points to a mixed product-and-services model.	Medium	SV003, SV004, SV015, SV016, SV022, SV023
CV029	A defensible bear range is roughly $0.4 billion to $0.8 billion if financial opacity persists and the company is valued more like a narrow high-assurance contractor than a scaled platform.	Medium	SV010, SV011, SV015, SV019, SV020
CV030	Downside triggers include failure to evidence recurring software revenue, worsening customer concentration, weak gross-margin improvement, or procurement delays that stall expansion.	Medium	SV005, SV008, SV009, SV010
CV031	At the disclosed $1.25 billion price, the evidence-constrained recommendation is research-more rather than buy.	Medium	SV005, SV008, SV009, SV010, SV012
CV032	Recommendation confidence should be medium because financing facts are well corroborated but operating KPIs remain mostly private.	Medium	SV001, SV003, SV004, SV005, SV008
CV033	Risk rating should be high because valuation opacity, concentration questions, and services-vs-software uncertainty still dominate underwriting risk.	Medium	SV005, SV008, SV010, SV021
CV034	Valuation stance should be expensive because today’s price already assumes platform-scale outcomes that public evidence has not yet demonstrated.	Medium	SV003, SV004, SV009, SV012, SV015
CV035	Entry discipline should require either a materially lower entry valuation or a management data room that proves ARR, gross margin, revenue mix, and customer diversification.	Medium	SV008, SV009, SV010, SV021
CV036	Thin third-party validation is itself adverse evidence because much of the bullish valuation narrative comes from company and investor materials plus pickup coverage rather than audited metrics.	Medium	SV005, SV006, SV007, SV029, SV030
CV037	RTX being both an investor and a named customer weakens the independence of one marquee proof point.	Medium	SV003, SV004, SV006, SV029
CV038	Exit readiness for a public-market process is not supportable from public evidence because Code Metal discloses far less financial detail than public comparable vendors reveal.	Medium	SV009, SV010, SV012, SV018, SV021
CV039	Final diligence should focus first on revenue quality, cap-table terms, backlog versus services mix, and customer concentration because those items would most change the underwriting conclusion.	Medium	SV008, SV010, SV021, SV031
CV040	A thesis-break condition is failure to turn named program momentum into repeatable software economics before the next major financing event.	Medium	SV001, SV003, SV009, SV010, SV015, SV016
CV041	If management can produce board-level KPI evidence consistent with a Snyk-like or Sonar-like trajectory, current skepticism could soften materially.	Medium	SV009, SV012, SV016
CV042	If the business remains delivery-heavy, Code Metal belongs closer to niche modernization or technical-services outcomes than to elite developer-platform valuations.	Medium	SV015, SV019, SV020, SV022, SV023
CV043	Probability should skew toward base and bear rather than bull because present evidence is stronger on financing velocity than on monetization depth.	Medium	SV005, SV008, SV009, SV010, SV012, SV015
CV044	A zero-value interpretation would be too harsh because technical differentiation, named customers, and repeated financing are all corroborated in retained evidence.	Medium	SV003, SV004, SV024, SV025, SV029

Sources
ID	Publisher	Title	Quote
SO001	Code Metal	Homepage	Verifiable code translation for industries where every line of code matters.
SO002	Code Metal	About Us
SO003	Code Metal	Careers
SO004	Code Metal	Product
SO005	Code Metal	AI Code That Works — and Proves It	At Code Metal, this is the central problem we are trying to solve: building AI-driven code translation systems for domains where correctness is non-negotiable.
SO006	Code Metal	Verified Code Transpilation with LLMs
SO007	Code Metal	Code Metal Secures $16.5M in Seed Funding	Code Metal ... announced a $13M seed, led by Shield Capital, and a prior $3.45M pre-seed round, led by J2 Ventures.
SO008	Code Metal / hosted launch article	Researchers launch Code Metal, Boston startup using AI	The idea from cofounders Peter Morales and Alex Showalter-Bucher is that developers will write software in a common programming language like Python and then Code Metal will quickly translate it to run on a particular device's hardware chip.
SO009	Code Metal	Code Metal Raises $36.5 Million for Verifiable AI-Powered Code Translation	Code Metal today announced it has closed $36.5 million of funding for its Series A round ... The round was led by venture capital firm Accel and values the company at $250 million.
SO010	Code Metal	Code Metal Closes $125 Million Series B	Code Metal ... announced the close of its $125 million Series B financing led by Salesforce Ventures ... The company also announced that Ryan Aytay ... has joined Code Metal as President and Chief Operating Officer.
SO011	Code Metal	Forbes Covers Code Metal
SO012	Code Metal	Code Metal's Metal Ops Hackathon Unveils Cutting-Edge Smart City Concepts for USSOCOM	The Metal Ops: Smart City Hackathon, hosted by Code Metal in downtown Boston from March 14-16, 2025, has successfully concluded.
SO013	BusinessWire	Code Metal Secures $125M Series B at $1.25B Valuation to Bridge the Trust Gap in AI Code Generation	Customers, including Toshiba, RTX, L3Harris, and the U.S. Air Force, use Code Metal to move between programming languages and optimize software for hardware at unprecedented speed.
SO014	Salesforce Ventures	Verified Code for Mission-Critical Systems	Their team — led by co-founders Peter Morales and Alex Showalter-Bucher ... has assembled the deepest private-sector bench in formal methods and safety-critical systems.
SO015	B Capital	Translating Code When Failure is Not an Option: Why We Invested in Code Metal	This hybrid approach creates a translation and optimization engine reliable enough for defense and industrial customers, and engineers trust it in production.
SO016	CNBC	AI startup Code Metal is going beyond vibe coding with the help of $36 million in fresh capital
SO017	Wired	Code Metal Raises $125 Million to Rewrite the Defense Industry's Code With AI	While some of the methodologies behind their technology remain unproven, investors are willing to gamble that at least a few will pan out.
SO018	GeekWire	Tech Moves: ... ex-Tableau CEO lands at Code Metal	Former Tableau CEO Ryan Aytay is the new president and chief operating officer of Code Metal, a Boston-based company...
SO019	MassRobotics	Code Metal Raises $125 Million to Rewrite the Defense Industry's Code With AI
SO020	Tech Funding News	Code Metal raises $125M: Can neuro-symbolic AI close the trust gap in mission-critical software?
SO021	U.S. Securities and Exchange Commission	EDGAR search results for Code Metal Form D filings
SO022	Smith Point Capital	Smith Point Capital \| Home
SO023	Overmatch VC	Portfolio
SO024	University of California San Diego	Loris D'Antoni	I'm a professor in the Programming Systems Group and a Scholar at Code Metal.
SO025	arXiv	Verified Code Transpilation with LLMs
SO026	Metalift	Metalift · A program synthesis framework for verified lifting applications
SM001	Code Metal	Code Metal - Verifiable Code Translation	Choose your edge environment configuration - pick a CPU ... GPUs ... FPGAs ... and preferred toolchains.
SM002	Code Metal	Code Metal - Verifiable Code Translation	Testing can show the presence of bugs, but never their absence.
SM003	Code Metal	The Real Cost of Leaving NVIDIA: What Automated Transpilation Actually Costs	The findings below show how Code Metal’s automated and verifiable transpilation platform can achieve close to expert-level performance.
SM004	Code Metal	Verified Code Transpilation with LLMs
SM005	Code Metal	Code Metal - Verifiable Code Translation
SM006	Code Metal	Code Metal - Verifiable Code Translation
SM007	Code Metal	Workflows vs Agents for Code Translation
SM008	Code Metal	UniPar: A Unified LLM-Based Framework for Parallel and Accelerated Code Translation in HPC
SM009	Code Metal	Code Metal - Verifiable Code Translation	With its technology already deployed across defense, automotive, semiconductor, and other mission-critical industries...
SM010	Salesforce Ventures	Verified Code for Mission-Critical Systems
SM011	B Capital	Translating Code When Failure is Not an Option: Why We Invested in Code Metal
SM012	DARPA	ARCOS \| DARPA	Current certification practices within the Department of Defense are antiquated and unable to scale with the amount of software deployed.
SM013	DARPA	High-Assurance Cyber Military Systems (HACMS)
SM014	NIST	AI Risk Management Framework
SM015	CISA	Memory Safe Languages: Reducing Vulnerabilities in Modern Software Development \| CISA
SM016	National Security Agency	NSA and CISA Release CSI Highlighting Importance of Memory Safe Languages in Software Security
SM017	Department of Defense Chief Information Officer	Software Modernization Implementation Plan FY25 – 26
SM018	Department of Defense	DoDI 5000.87 Operation of the Software Acquisition Pathway
SM019	Grand View Research	Application Modernization Services Market Size Report, 2030
SM020	Mordor Intelligence	Application Security Market Size, Scope, Demand Report 2031
SM021	Sonar	Code Verification for the AI Era
SM022	GrammaTech	GrammaTech \| Software Assurance & Cyber-Security Solutions
SM023	Galois	Galois - Home
SM024	IBM	AI coding agent \| IBM
SM025	arXiv	Security Weaknesses of Copilot-Generated Code in GitHub Projects: An Empirical Study	Our analysis identified 733 snippets, revealing a high likelihood of security weaknesses.
SM026	Booz Allen	Artificial Intelligence
SM027	SAIC	SAIC \| Data and Artificial Intelligence
SM028	Diffblue	Diffblue — The AI Testing Agent for Enterprise Unit Testing - Diffblue
SM029	Software Engineering Institute	Software Engineering Institute \| CMU Software Engineering Institute
SM030	Snyk	Snyk AI Security Fabric \| Secure Code, Models & Agents \| Snyk
SP001	Code Metal	Code Metal - Verifiable Code Translation
SP002	Code Metal	Code Metal - Verifiable Code Translation
SP003	Code Metal	Code Metal - Verifiable Code Translation
SP004	Code Metal	Verified Code Transpilation with LLMs
SP005	arXiv	Verified Code Transpilation with LLMs
SP006	Code Metal	The Real Cost of Leaving NVIDIA: What Automated Transpilation Actually Costs
SP007	Business Wire	Code Metal Secures $125M Series B at $1.25B Valuation to Bridge the Trust Gap in AI Code Generation
SP008	Salesforce Ventures	Verified Code for Mission-Critical Systems
SP009	B Capital	Translating Code When Failure is Not an Option: Why We Invested in Code Metal
SP010	DARPA	ARCOS \| DARPA
SP011	DARPA	High-Assurance Cyber Military Systems (HACMS)
SP012	NIST	AI Risk Management Framework
SP013	NIST	Artificial intelligence
SP014	GrammaTech	GrammaTech \| Software Assurance & Cyber-Security Solutions
SP015	GrammaTech	About GrammaTech \| GrammaTech
SP016	Galois	Galois - Home
SP017	Diffblue	Diffblue — The AI Testing Agent for Enterprise Unit Testing - Diffblue
SP018	Diffblue	Diffblue Cover - Diffblue
SP019	Snyk	Snyk AI Security Fabric \| Secure Code, Models & Agents \| Snyk
SP020	Snyk	Snyk Code \| SAST Code Scanning Tool \| Code Security Analysis & Fixes \| Snyk
SP021	Sonar	Code Verification for the AI Era
SP022	IBM	AI coding agent \| IBM
SP023	IBM	IBM watsonx
SP024	Booz Allen Hamilton	Artificial Intelligence
SP025	SAIC	SAIC \| Data and Artificial Intelligence
SP026	Metalift	Metalift · A program synthesis framework for verified lifting applications
SP027	arXiv	Security Weaknesses of Copilot-Generated Code in GitHub Projects: An Empirical Study
SP028	WIRED	Code Metal Raises $125 Million to Rewrite the Defense Industry's Code With AI
SP029	Diffblue	About - Diffblue
SP030	GrammaTech	Learn About Cybersecurity & Software Assurance \| GrammaTech
SP031	Sonar	SonarQube
SP032	IBM	AI coding agent \| IBM
SI001	Code Metal	Code Metal - Verifiable Code Translation	Verifiable code translation for industries where every line of code matters
SI002	Code Metal	Code Metal Product	Choose your edge environment configuration - pick a CPU ... and a combination of accelerators including GPUs ... FPGAs ... and preferred toolchains.
SI003	Code Metal	Code Metal Careers	Facility Security Officer ... Vice President of Finance ... Forward Deployed Engineer ... Principal Solutions Architect
SI004	Code Metal	AI-Generated Code That Works — and Proves It	The question is not merely whether software appears to work during testing, but whether we can establish stronger guarantees about all possible behaviors of the system.
SI005	Code Metal	Verified Code Transpilation with LLMs	LLMLift combines LLM-powered translation with proof generation to create formal proofs establishing functional equivalence.
SI006	Code Metal	The Real Cost of Leaving NVIDIA: What Automated Transpilation Actually Costs	The conventional approach requires finding kernel engineers who know the target architecture, budgeting weeks per kernel to port the code, then budgeting weeks more to tune it.
SI007	Code Metal	Combining AI with Formal Verification for Efficient Migration of Legacy Code	LLMLift automatically translates input programs ... into different target languages ... and develops a formal verification based method to verify LLM outputs.
SI008	Code Metal	Workflows vs Agents for Code Translation	The paper compares two LLM-driven approaches for syntax repair in MATLAB-to-HDL translation.
SI009	Code Metal	Code Metal raises $36.5 million for verifiable AI-powered code translation	The raise marks a new phase of growth for Code Metal ... and is already on contract to deliver eight figures in revenue this year.
SI010	Code Metal	Code Metal closes $125 million Series B	Customers, including Toshiba, RTX, L3Harris, and the U.S. Air Force, use Code Metal to move between programming languages and optimize software for hardware at unprecedented speed.
SI011	Business Wire	Code Metal Secures $125M Series B at $1.25B Valuation to Bridge the Trust Gap in AI Code Generation	In just one year, demand has pulled Code Metal into programs of record across the U.S. Air Force, L3Harris, and more.
SI012	Salesforce Ventures	Verified Code for Mission-Critical Systems	Manual code rewrites are slow, risky, and unscalable.
SI013	B Capital	Translating Code When Failure is Not an Option: Why We Invested in Code Metal	In domains like defense and aerospace, “good enough” is not good enough.
SI014	CNBC	AI startup Code Metal is going beyond vibe coding with the help of $36 million in fresh capital
SI015	WIRED	Code Metal Raises $125 Million to Rewrite the Defense Industry’s Code With AI	While some of the methodologies behind their technology remain unproven, investors are willing to gamble that at least a few will pan out.
SI016	Tech Funding News	Code Metal raises $125M: Can neuro-symbolic AI close the trust gap in mission-critical software?
SI017	MassRobotics	Code Metal Raises $125 Million to Rewrite the Defense Industry’s Code With AI	The startup is also working with Japanese electronics company Toshiba and says it’s in talks with a large chip company ... though the company declined to say which one.
SI018	Securities and Exchange Commission	EDGAR Search Results: Code Metal Form D filings	Acc-no: 0002001452-26-000001 ... 2026-03-12 ... 0002001452-25-000003 ... 2025-11-13 ... 0002001452-24-000001 ... 2024-08-01
SI019	DARPA	ARCOS \| DARPA	Current certification practices within the Department of Defense are antiquated and unable to scale with the amount of software deployed.
SI020	DARPA	High-Assurance Cyber Military Systems (HACMS)	HACMS will adopt a clean-slate, formal methods-based approach ... capable of producing a machine-checkable proof.
SI021	NIST	AI Risk Management Framework	The AI RMF is intended ... to improve the ability to incorporate trustworthiness considerations into the design, development, use, and evaluation of AI products, services, and systems.
SI022	Code Metal	UniPar: A Unified LLM-Based Framework for Parallel and Accelerated Code Translation in HPC	Their approach ... improved performance from 46% to 69% compilation success and 15% to 33% functional correctness.
SI023	Code Metal	MONOCODER: Domain-Specific Code Language Model for HPC Codes and Tasks	Specialized, domain-focused models can achieve better results with fewer parameters than general-purpose alternatives.
SI024	Code Metal	Research & Insights	Explore our latest research on AI, edge computing, MLOps, and enterprise technology solutions.
SI025	Code Metal	Code Metal secures $16.5M in seed funding	Code Metal ... announced a $13M seed, led by Shield Capital, and a prior $3.45M pre-seed round, led by J2 Ventures.
SI026	MLQ.ai	Code Metal Lands $125M Series B to Modernize Legacy Defense Software with AI
SI027	AITech365	Code Metal Raises $125M Series B for Verifiable AI
SI028	Intelligence360	Code Metal Secures $125M Series B at $1.25B Valuation to Bridge the Trust Gap in AI Code Generation
SI029	TechNews180	Code Metal Raises $125M Series B at $1.25B Valuation
SI030	ai2.work	Code Metal's $125M Series B Bets AI Can Verify Legacy Code at Scale
SI031	UBOS	Code Metal Secures $125 Million Series B to Accelerate AI‑Driven Defense Software
SE001	Code Metal	Code Metal - Verifiable Code Translation	Where AI’s generalization meets program analysis’s precision. Uniting high-level reasoning with low-level verification to produce tested optimized and compliant code.
SE002	Code Metal	Code Metal — Verifiable Code Translation	Load your high level reference code written in Python Matlab or Julia in your favorite IDE with the CodeMetal plugin installed.
SE003	Code Metal	Code Metal — Verifiable Code Translation	From Formal Methods to Compiler Design applying our specialties to help customers is what drives us.
SE004	Code Metal	Code Metal — Verifiable Code Translation	Senior/Principal Software Engineer Compiler & AI Tooling; Research Engineer - Formal Methods; Senior Platform DevOps Engineer Cloud + On-Prem.
SE005	Code Metal	Code Metal — Verifiable Code Translation	We implement reasonable security measures to protect your information; however no method of transmission over the internet or method of electronic storage is 100% secure and we cannot guarantee absolute security.
SE006	Code Metal	Code Metal - Verifiable Code Translation	Formal methods are mathematically rigorous techniques for proving that a program satisfies a specification.
SE007	Code Metal	Verified Code Transpilation with LLMs
SE008	Code Metal	Code Metal - Verifiable Code Translation	The amount of time taken by Tenspiler for migration is directly related to the complexity of the input as well as the output program.
SE009	Code Metal	Code Metal - Verifiable Code Translation	All of the generated kernels are validated to be correct.
SE010	Code Metal	Workflows vs Agents for Code Translation
SE011	Code Metal	UniPar: A Unified LLM-Based Framework for Parallel and Accelerated Code Translation in HPC
SE012	Code Metal	Counting Without Running: Evaluating LLMs' Reasoning About Code Complexity	Modern LLMs handle straightforward kernels well but struggle significantly with complex scenarios involving division math functions or shared subexpressions.
SE013	Code Metal	Code Metal — Verifiable Code Translation
SE014	Code Metal	Code Metal - Verifiable Code Translation	AI code generation has hit an inflection point: mission-critical industries cannot deploy what they cannot verify.
SE015	Code Metal	Code Metal - Verifiable Code Translation	Vibe coding helps software teams build MVPs fast. But it does not address our customers' needs: writing zero-error production code onto hardware.
SE016	Code Metal	Code Metal - Verifiable Code Translation	Code Metal will use this funding to continue building modular and verifiable agentic workflows that transform product development timelines from months to days.
SE017	Business Wire	Code Metal Secures $125M Series B at $1.25B Valuation to Bridge the Trust Gap in AI Code Generation
SE018	Salesforce Ventures	Verified Code for Mission-Critical Systems	Code Metal has built something fundamentally different: a neuro-symbolic platform that marries generative AI with formal verification.
SE019	B Capital	Translating Code When Failure is Not an Option: Why We Invested in Code Metal
SE020	DARPA	Automated Rapid Certification of Software
SE021	DARPA	High-Assurance Cyber Military Systems
SE022	NIST	AI Risk Management Framework
SE023	arXiv	Verified Code Transpilation with LLMs
SE024	arXiv	Tenspiler: A Verified Lifting-Based Compiler for Tensor Operations (Extended Version)
SE025	Metalift	Metalift · A program synthesis framework for verified lifting applications
SE026	UC Berkeley EECS	Alvin Cheung's Homepage
SE027	UC San Diego	Loris D'Antoni
SE028	Code Metal	Code Metal - Verifiable Code Translation
SE029	Code Metal	Code Metal - Verifiable Code Translation
SE030	Code Metal	Code Metal - Verifiable Code Translation
SE031	Code Metal	MONOCODER: Domain-Specific Code Language Model for HPC Codes and Tasks
SU001	Code Metal	Code Metal - Verifiable Code Translation	How some of our customers are using Code Metal.
SU002	Code Metal	Code Metal Product	Choose your edge environment configuration - pick a CPU ... GPUs ... FPGAs ... and preferred toolchains.
SU003	Code Metal	Code Metal Careers	Facility Security Officer ... Forward Deployed Engineer ... Principal Solutions Architect.
SU004	Code Metal	Code Metal Closes $125 Million Series B, Ryan Aytay Joins as COO	Customers, including Toshiba, RTX, L3Harris, and the U.S. Air Force, use Code Metal to move between programming languages and optimize software for hardware at unprecedented speed.
SU005	Business Wire	Code Metal Secures $125M Series B at $1.25B Valuation to Bridge the Trust Gap in AI Code Generation	Demand has pulled Code Metal into programs of record across the U.S. Air Force, L3Harris, and more.
SU006	Salesforce Ventures	Verified Code for Mission-Critical Systems	Code Metal has demonstrated exceptional velocity in its first year of commercialization, securing customers including L3Harris, Raytheon, and the U.S. Air Force.
SU007	B Capital	Translating Code When Failure is Not an Option: Why We Invested in Code Metal	Code Metal has built remarkable commercial momentum ... winning customers including the U.S. Air Force, L3Harris, Toshiba, and RTX.
SU008	CNBC	AI startup Code Metal is going beyond vibe coding with the help of $36 million in fresh capital	Code Metal said it is already on contract to deliver eight figures in revenue this year.
SU009	Wired	Code Metal Raises $125 Million to Rewrite the Defense Industry’s Code With AI	While some of the methodologies behind their technology remain unproven, investors are willing to gamble that at least a few will pan out.
SU010	Code Metal	Code Metal Raises $36.5 Million for Verifiable AI-Powered Code Translation	Shield Capital's additional commitments reflect ... delivering results, such as speeding code translation from weeks to days for L3Harris across several projects.
SU011	Code Metal	Code Metal Secures $16.5M in Seed Funding	Code Metal is already generating revenue and has established strategic partnerships with industry leaders in edge deployments, including X-Press Feeders ... and L3Harris.
SU012	U.S. Securities and Exchange Commission	EDGAR Search Results	D ... 2026-03-12 ... 2025-11-13 ... 2024-08-01 ... 2023-12-20.
SU013	Accel	Code Metal	AI developer tools for edge environments.
SU014	GeekWire	Tech Moves: Code.org has a new leader; Synapse vet joins Amazon; ex-Tableau CEO lands at Code Metal	The combination of world-class tech, real customer demand, and team building with urgency and integrity is rare.
SU015	arXiv	Verified Code Transpilation with LLMs	Lifting allows developers to port code to DSLs from which efficient code can be generated for special-purpose hardware, such as GPUs, machine learning accelerators, or network processors.
SU016	Code Metal	The Real Cost of Leaving NVIDIA: What Automated Transpilation Actually Costs	The Real Cost of Leaving NVIDIA: What Automated Transpilation Actually Costs.
SU017	DARPA	High-Assurance Cyber Military Systems (HACMS)	High-Assurance Cyber Military Systems (HACMS).
SU018	NIST	AI Risk Management Framework	AI Risk Management Framework.
SU019	AICOSoft	Code Metal Raises $125M for AI That Modernizes Defense Software	The goal isn't just to work on one-off projects. It's to create a reliable, repeatable process that can be used across the entire defense industry.
SU020	Give Me Technology	Code Metal Raises $125M to Modernize Defense Legacy Code	The company serves customers including Toshiba, RTX (formerly Raytheon Technologies), L3Harris Technologies, and the U.S. Air Force.
SU021	X	Andrej Karpathy (@karpathy) on X	It feels likely that we'll end up re-writing large fractions of all software ever written many times over.
SU022	Code Metal	Wired covers Code Metal	AI startup Code Metal is going beyond vibe coding with the help of $36 million in fresh capital.
SU023	Code Metal	TBPN interview	Code Metal announced the close of its $125 million Series B financing.
SU024	arXiv	Tenspiler: A Verified Lifting-Based Compiler for Tensor Operations (Extended Version)	Leveraging these languages requires developers to rewrite existing code using the specific DSL's API.
SU025	XpiryAI	Code Metal Raises $125M to Rewrite Defense Code With AI	This startup uses AI to translate old defense software into modern languages.
SU026	J2 Ventures	Companies	Meet the groundbreaking companies authoring the next chapters of paradigm-shifting innovation.
SU027	Shield Capital	Mission Matters Podcast	The Mission Matters podcast explores the intersection of technology, national security, and startups.
SU028	USAspending	Federal Awards \| Advanced Search \| USAspending	Start your search by adding filters.
SU029	SAM.gov	SAM.gov Search	SAM.gov \| Search
SR001	Code Metal	Homepage	Verifiable code translation for industries where every line of code matters.
SR002	Code Metal	Product
SR003	Code Metal	Careers
SR004	Code Metal	Privacy Policy	We collect various types of information in connection with the Service, including personal data, usage data, and cookies and tracking technologies.
SR005	Code Metal	About Us
SR006	Code Metal	AI-Generated Code That Works — and Proves It
SR007	Code Metal	Verified Code Transpilation with LLMs
SR008	Code Metal	Counting Without Running: Evaluating LLMs' Reasoning About Code Complexity	Modern LLMs handle straightforward kernels well but struggle significantly with complex scenarios involving division, math functions, or shared subexpressions.
SR009	Code Metal	Code Metal Raises $36.5 Million for Verifiable AI-Powered Code Translation
SR010	Code Metal	Code Metal Closes $125 Million Series B
SR011	Business Wire	Code Metal Secures $125M Series B at $1.25B Valuation to Bridge the Trust Gap in AI Code Generation
SR012	Salesforce Ventures	Verified Code for Mission-Critical Systems	Demand has already pulled Code Metal into programs of record across the Air Force, L3Harris, and more.
SR013	B Capital	Translating Code When Failure is Not an Option: Why We Invested in Code Metal
SR014	CNBC	AI startup Code Metal is going beyond vibe coding with the help of $36 million in fresh capital
SR015	WIRED	Code Metal Raises $125 Million to Rewrite the Defense Industry's Code With AI
SR016	GeekWire	Tech Moves: ex-Tableau CEO lands at Code Metal
SR017	U.S. Securities and Exchange Commission	EDGAR search results for Code Metal Form D filings
SR018	DARPA	ARCOS
SR019	DARPA	High-Assurance Cyber Military Systems (HACMS)
SR020	NIST	AI Risk Management Framework
SR021	CISA	Memory Safe Languages: Reducing Vulnerabilities in Modern Software Development
SR022	Department of Defense Chief Information Officer	Software Modernization Implementation Plan FY25–26
SR023	Department of Defense	DoDI 5000.87 Operation of the Software Acquisition Pathway
SR024	European Commission	Regulatory framework on artificial intelligence	The AI Act puts in place rules for providers of such models, including transparency and copyright-related rules.
SR025	U.S. Copyright Office	Report on Copyright and Artificial Intelligence	On May 9, 2025, the Office released a pre-publication version of Part 3 in response to congressional inquiries and expressions of interest from stakeholders.
SR026	arXiv	Verified Code Transpilation with LLMs	While large language models have shown some success in automatic code transpilation, none of them provide any functional correctness guarantees on the transpiled code.
SR027	arXiv	Tenspiler: A Verified Lifting-Based Compiler for Tensor Operations (Extended Version)
SR028	UC Berkeley	Alvin Cheung's Homepage	Code generated using our techniques are now deployed at Adobe and Google.
SR029	University of California San Diego	Loris D'Antoni	I'm a professor in the Programming Systems Group and a Scholar at Code Metal.
SR030	J2 Ventures	Companies
SR031	Shield Capital	Mission Matters Podcast
SR032	USAspending	Federal Awards \| Advanced Search \| USAspending
SR033	SAM.gov	SAM.gov Search
SR034	Code Metal	The Real Cost of Leaving NVIDIA: What Automated Transpilation Actually Costs
SR035	European Commission	Navigating the AI Act	The AI Act introduces a uniform framework across all EU Member States, based on a risk-based approach.
SR036	U.S. Copyright Office	Copyright and Artificial Intelligence, Part 2: Copyrightability Report	Copyright does not extend to purely AI-generated material, or material where there is insufficient human control over the expressive elements.
SR037	U.S. Government Accountability Office	Artificial Intelligence Acquisitions: Agencies Should Collect and Apply Lessons Learned to Improve Future Procurements	Agencies reported difficulty accessing AI technical experts to evaluate contractor proposals and said it was hard to understand AI-related costs.
SR038	U.S. Government Accountability Office	Artificial Intelligence: DOD Needs Department-Wide Guidance to Inform Acquisitions	Without department-wide and tailored service-level guidance, DOD is missing an opportunity to consistently acquire AI capabilities in a manner that accounts for the unique challenges associated with AI.
SR039	CISA	Secure by Design	Products designed with Secure by Design principles prioritize the security of customers as a core business requirement.
SR040	CISA	Software Must Be Secure by Design, and Artificial Intelligence Is No Exception	AI is a type of software system, and like any software system, AI must be Secure by Design.
SV001	Code Metal	Code Metal raises $36.5 million for verifiable AI-powered code translation	The company said it raised $36.5 million in Series A funding at a $250 million valuation.
SV002	CNBC	AI startup Code Metal is going beyond vibe coding with the help of $36 million in fresh capital
SV003	Code Metal	Code Metal closes $125 million Series B	Code Metal announced the close of its $125 million Series B financing led by Salesforce Ventures.
SV004	Business Wire	Code Metal Secures $125M Series B at $1.25B Valuation to Bridge the Trust Gap in AI Code Generation	Code Metal secures $125M Series B at $1.25B valuation.
SV005	WIRED	Code Metal Raises $125 Million to Rewrite the Defense Industry’s Code With AI	Methodologies in AI code tooling remain unproven, and investors are betting some picks-and-shovels vendors will work.
SV006	Salesforce Ventures	Verified Code for Mission-Critical Systems
SV007	B Capital	Translating Code When Failure is Not an Option: Why We Invested in Code Metal
SV008	U.S. Securities and Exchange Commission	EDGAR Search Results: Code Metal Form D filings
SV009	TechCrunch	https://techcrunch.com/2024/12/06/snyk-hits-300m-arr-but-isnt-rushing-to-go-public/	Snyk, the developer security startup most recently valued at $7.4 billion, hit $300 million ARR and is not rushing to go public.
SV010	CTech by Calcalist	https://www.calcalistech.com/ctechnews/article/684uz2na8	Snyk reported significantly slower revenue growth in 2024, generating $278 million last year, a 26% increase.
SV011	Forge	https://forgeglobal.com/insights/snyk-upcoming-ipo-news/	The company's valuation reached as high as $8.5 billion but has since come down to $7.4B.
SV012	SonarSource	https://www.sonarsource.com/company/press-releases/sonar-raises-412-million/	Sonar announced it raised $412 million at a valuation of $4.7 billion.
SV013	Business Wire	https://www.businesswire.com/news/home/20220426005213/en/SonarSource-the-Leading-Platform-for-Clean-Code-Raises-%24412-Million-in-New-Investment
SV014	SecurityWeek	https://www.securityweek.com/code-security-firm-sonarsource-raises-412-million-47-billion-valuation/
SV015	Diffblue	https://www.diffblue.com/resources/diffblue-secures-6-3-million-in-new-funding-amidst-3x-growth-period/	Diffblue secured $6.3 million in new funding amidst a 3x growth period.
SV016	Parkwalk Advisors	https://parkwalk.vc/article/diffblue-secures-6-3m-in-funding/	Diffblue announced it secured $6.3 million in new capital as it reached 326% net new ARR growth from the prior six months.
SV017	IBM	AI coding agent \| IBM	Agentic development environments are redefining how enterprises build and modernize software.
SV018	IBM	https://www.ibm.com/investor/annual-report
SV019	Booz Allen Hamilton	Artificial Intelligence	As the number one provider of AI solutions to the federal government, we help propel society toward positive outcomes.
SV020	SAIC	SAIC \| Data and Artificial Intelligence
SV021	SAIC	https://investors.saic.com/financials/sec-filings/default.aspx
SV022	GrammaTech	GrammaTech \| Software Assurance & Cyber-Security Solutions	30+ years of cyber innovation focused on security, resilience, automation, and developer productivity.
SV023	Galois	Galois - Home	Galois delivers high-assurance solutions and tools across aerospace & defense, semiconductors, and enterprise IT.
SV024	Code Metal	Code Metal Product
SV025	Code Metal	Verified Code Transpilation with LLMs
SV026	Grand View Research	Application Modernization Services Market Size Report, 2030
SV027	Mordor Intelligence	Application Security Market Size, Scope, Demand Report 2031
SV028	Diffblue	https://www.diffblue.com/about-us/
SV029	MassRobotics	Code Metal Raises $125 Million to Rewrite the Defense Industry’s Code With AI
SV030	Tech Funding News	Code Metal raises $125M: Can neuro-symbolic AI close the trust gap in mission-critical software?
SV031	Code Metal	Code Metal secures $16.5M in seed funding	Code Metal secured $16.5 million in seed funding after previously raising a $3.45 million pre-seed.

Cover facts

Company profile

Executive summary

Top strengths

Top risks

Open gaps

Contents

1.1 Identity, Product, and Operating Footprint

1.2 Founders, Leadership, and Governance Concentration

1.3 Funding History, Investor Map, and Strategic Overlap

1.4 Public Scale Signals, Milestones, and Evidence Gaps

2.1 Market boundary: high-assurance modernization, not generic copilots

2.2 Sizing with evidence-constrained lenses rather than one headline TAM

2.3 Buyer, user, and payer segmentation

2.4 Growth drivers, certification friction, and market limits

2.5 Exhibits

3.1 Landscape and Nearest Alternatives

3.2 Capability Breadth and Verification Posture

3.3 Deployment, Pricing, and Budget Owner

3.4 Switching Costs, Substitutes, and Moat Durability

3.5 Exhibits

4.1 Revenue model and pricing posture: contract visibility is real, but realized economics are not

4.2 Traction and GTM proxies: visible logos, strategic investors, and high-touch sales indicators

4.3 Cost structure and margin path: R&D-heavy today, with leverage only if delivery standardizes

4.4 Capital adequacy and financing dependency: Series B buys time, but not proof of repeatable revenue

4.5 Evidence gaps and financial verdict: analytically positive, still too opaque for clean underwriting

5.1 Product Workflow in Customer Terms

5.2 Architecture, Lineage, and Operating Model

5.3 Deployment, Integration, Support, and Maturity

5.4 Trust, Safety, Limits, and Diligence Priority

5.5 Exhibits

6.1 Customer map: public evidence points to four real buyer buckets, plus one still-unnamed semiconductor class

6.2 Named proof: the logos are real, but most public evidence still comes from company and investor channels

6.3 Durability: likely sticky if deployments land, but public retention evidence is still mostly absent

6.4 Reference risk: few logos, heavy defense mix, and thin third-party validation keep concentration risk high

6.5 Exhibits

7.1 Risk priority and adverse stance

7.2 Regulatory, legal, and provenance risk

7.3 Operational, partner, and execution risk

7.4 Financial risk, mitigations, and thesis-breaks

8.1 Financing context and public price support

8.2 Investment thesis and anti-thesis

8.3 Scenario ranges and comparable boundaries

8.4 Recommendation, entry discipline, and final asks

8.5 Exhibits

Disclaimer

Evidence index