ClickHouse
Open-source analytics leader with strong product pull, but late-stage pricing already discounts heavy execution.
ClickHouse has strong product-market pull, credible cloud monetization, and marquee customer adoption, but its private-market valuation already prices in substantial future execution despite limited public disclosure on revenue quality and margins.
Cover facts
Company profile
ClickHouse is the commercial company behind the open-source ClickHouse analytics database, combining a developer-led adoption engine with a managed cloud product for real-time analytics, observability, and emerging AI data workloads. The corporate entity was formed in 2021 around technology that originated inside Yandex and was open-sourced in 2016. Public evidence shows a San Francisco-centered company with a globally distributed engineering footprint, rapid customer growth, and a financing profile that expanded materially in 2025.
- Website
- clickhouse.com
- Founded
- 2021-08-25
- Founders
- Aaron Katz, Alexey Milovidov, Yury Izrailevsky
- Founding location
- Portola Valley, California, United States
- Headquarters
- San Francisco, California, United States
- Product
- ClickHouse sells an open-source column-oriented OLAP database plus ClickHouse Cloud, a fully managed service that handles scaling, operations, and infrastructure across AWS, Azure, and GCP.
- Customers
- Developers, data-platform teams, and enterprises running real-time analytics, observability, and AI-adjacent data workloads.
- Business model
- Open-source distribution feeding usage-based managed cloud revenue, with enterprise expansion through dedicated and BYOC deployments.
- Stage
- Series C private company
- Funding status
- Raised a $350M Series C in May 2025 at roughly a $6.35B valuation, taking disclosed funding above $650M.
Executive summary
Top strengths
- Strong open-source distribution and developer mindshare anchored by a widely adopted analytics engine and large contributor community.
- Credible cloud monetization with multi-cloud availability, usage-based pricing, and evidence of rapid ARR expansion.
- Referenceable enterprise and internet-scale customer base including Cloudflare and Contentsquare with concrete performance and cost outcomes.
Top risks
- Public disclosure still omits ARR quality, gross margin, retention, and cash-flow visibility needed to underwrite downside protection.
- The Series C price implies roughly mid-30s to low-40s ARR multiples, above many public and private comps.
- Open-source and bundled-warehouse competition from DuckDB, StarRocks, Snowflake, Databricks, and hyperscalers can compress pricing and expansion.
Open gaps
- Cohort-level ARR quality, retention, and gross margin data remain undisclosed.
- Customer concentration and top-account exposure are not visible from public sources.
- The economics of converting large open-source usage into durable paid cloud revenue remain unproven publicly.
Contents
01Company Overview
1.1 Identity, origin, and operating footprint
ClickHouse is easiest to understand as two linked entities: an analytics database project that began inside Yandex in 2009 and a commercial company incorporated in Delaware in 2021 to commercialize that project globally. The technical lineage matters because it explains why ClickHouse arrived on the market with unusual performance credibility: it was built for Yandex.Metrica-scale analytical workloads long before it became a venture-backed startup. The commercial company now describes the product as a fast, open-source, column-oriented database for real-time analytics, with monetization centered on ClickHouse Cloud and adjacent real-time analytics, observability, and AI/ML workloads. The footprint is similarly hybrid. Official history places headquarters in the San Francisco Bay Area, while current third-party profiles variously label the company as San Francisco, Portola Valley, or Palo Alto. Those labels are inconsistent at the street-address level but consistent on the larger point that ClickHouse is a Bay Area-headquartered company with a meaningful Amsterdam office and a deliberately distributed workforce. Official sources say employees are spread across more than 10 countries, and public profiles support a 500-plus employee scale by 2026. For later chapters, the most defensible shorthand is therefore: Bay Area headquarters, Amsterdam engineering and European hub, and globally distributed operations.[CO001, CO003, CO004, CO008, CO009, CO010]
| Metric | Value / Status | Date | Confidence | Gap / Note |
|---|---|---|---|---|
| Project start | 2009 inside Yandex | 2009 | high | Project origin, not company formation |
| Commercial company formation | Delaware corporation | Aug 2021 | high | Separate from 2009 project and 2012 production deployment |
| Open-source release | Apache 2.0 | 2016 | high | Foundation of developer adoption |
| HQ / office footprint | Bay Area HQ; Amsterdam office | current | medium | Portola Valley, Palo Alto, and San Francisco labels all appear in public sources |
| Headcount | 531-569 (500+) | Apr-May 2026 | medium | PitchBook and Tracxn differ but both support late-stage scale |
| Customers | >2,000 | 2025 | medium | Company disclosure; not independently audited |
| ARR / revenue markers | ~$160M ARR; ~$100M annualized revenue in H1 2025 | 2025 | low | ARR is third-party estimated and not audited |
| Series B | $250M at $2B valuation | Oct 2021 | high | Official press release plus investor databases corroborate |
| Series C | $350M at roughly $6.35B valuation | May 2025 | medium | Company release confirms raise; valuation comes from third-party coverage |
| Total funding after Series C | >$650M plus $100M credit facility | May 2025 | medium | Excludes later 2025/2026 activity for chapter consistency |
Rows reconcile project chronology, company formation, and 2025 scale markers; valuation and ARR remain third-party rather than audited company disclosures.
[CO001, CO004, CO010, CO014, CO015, CO016]Company logic links open-source origin, cloud monetization, capital, global team, and risk controls.
[CO004, CO008, CO009, CO011, CO015, CO016]Selected KPIs summarize maturity, traction, and continuity between open-source adoption and late-stage scale.
1.2 Founders, leadership, and governance
The founding team combines original technical authorship with enterprise go-to-market and product-scale experience. Alexey Milovidov created ClickHouse inside Yandex and remains the technical anchor as CTO. Aaron Katz, a former Salesforce and Elastic executive, brought the commercialization playbook and serves as CEO. Yury Izrailevsky, whose prior roles included senior engineering leadership at Netflix and Google, serves as president and provides the product and engineering-scale bridge between a beloved open-source project and a global software company. Governance is more investor-shaped than publicly transparent. Index's long-form origin story makes clear that Mike Volpi of Index Ventures and Peter Fenton of Benchmark were not passive capital providers; they worked with Katz on the spin-out structure that created an independent, majority-controlled Delaware company. Separate current-profile sources show both Volpi and Fenton serving on ClickHouse's board, which gives the company experienced infrastructure-software oversight even though the broader board and observer slate is not fully public. Leadership depth also improved in 2025 when ClickHouse added Kevin Egan as CRO, Mariah Nagy as VP People, and Jimmy Sexton as CFO. That trio matters because it signals a transition from founder-led formation into late-stage operating discipline across revenue, talent, and finance.[CO005, CO006, CO007, CO021, CO022, CO033]
| Person | Role | Background | Coverage / Relevance | Key-Person Dependency |
|---|---|---|---|---|
| Aaron Katz | Co-founder & CEO | Ex-Salesforce and Elastic operator | Commercialization, fundraising, and customer narrative | High |
| Alexey Milovidov | Co-founder & CTO | Creator of ClickHouse inside Yandex | Core architecture and technical credibility | High |
| Yury Izrailevsky | Co-founder & President | Former engineering leader at Netflix and Google | Product and engineering scale-up | High |
| Kevin Egan | Chief Revenue Officer | Former Atlassian, Slack, Dropbox, Salesforce leader | Enterprise sales expansion in 2025 | Medium |
| Mariah Nagy | Vice President of People | Former Weights & Biases, Confluent, SurveyMonkey | Distributed-team talent systems | Low |
| Jimmy Sexton | Chief Financial Officer | Former Snowflake and ServiceNow finance leader | Late-stage finance discipline | Medium |
| Mike Volpi | Board member / Index Ventures | Retired Index partner; long-time open-source investor | Investor-governance continuity from formation | Medium |
| Peter Fenton | Board member / Benchmark | Benchmark GP with major open-source and infrastructure wins | Investor-governance continuity from formation | Medium |
This is a partial but decision-relevant enumeration of founders, 2025 executive additions, and publicly visible investor-directors; the full org chart is not public.
[CO005, CO006, CO007, CO021, CO022, CO033]1.3 Capital base, investor continuity, and commercial scale
ClickHouse's funding history shows an unusually compressed venture trajectory. Index Ventures and Benchmark led the $50 million Series A in August 2021 as the spin-out formed. Barely two months later, the company raised a $250 million Series B at a $2 billion valuation, led by Coatue and Altimeter with participation from Benchmark, Lightspeed, Almaz, and other growth investors. That 2021 financing burst funded the cloud buildout and moved ClickHouse from an admired open-source project into a company with enough capital to scale globally. The next major re-rating arrived in May 2025. Company and investor releases confirm a $350 million Series C led by Khosla Ventures, with new investors BOND, IVP, Battery Ventures, and Bessemer joining a follow-on set that included Benchmark, Coatue, Lightspeed, FirstMark, GIC, and Nebius. Goodwin and company releases also confirm a parallel $100 million credit facility led by Stifel and Goldman Sachs. Third-party coverage places the round at roughly a $6.35 billion to $6.4 billion valuation and total funding above $650 million. Public operating signals fit a late-stage growth profile: 2,000-plus customers by 2025, more than 300% annual growth according to the company, public revenue/ARR markers around $100 million annualized in mid-2025 and roughly $160 million ARR by end-2025, and employee counts in the low-to-mid 500s by 2026.[CO017, CO018, CO019, CO020, CO023, CO024]
| Stakeholder | Round(s) / Instrument | Role | Importance / Control | Diligence Ask |
|---|---|---|---|---|
| Index Ventures | Series A; follow-on in later rounds | Founding institutional backer | Formation sponsor; Mike Volpi on board | Confirm current ownership and any pro-rata rights |
| Benchmark | Series A, Series B, Series C | Founding and continuing VC backer | Peter Fenton board presence adds governance weight | Confirm ownership, board committee roles, and reserves |
| Coatue | Series B lead; Series C follow-on | Growth investor | Validated 2021 re-rating and stayed through 2025 | Confirm current position size and any secondary activity |
| Altimeter | Series B co-lead | 2021 growth investor | Helped establish the $2B valuation step-up | Clarify whether it remained active post-2021 |
| Lightspeed | Series B; visible continuing investor | Growth VC and portfolio sponsor | Public portfolio page ties the firm directly to 2021 commercial scaling | Confirm ownership and board observer rights |
| Almaz Capital | Series B participant | Early growth investor | Shows cross-border investor mix in 2021 round | Confirm whether stake was maintained after Series C |
| Khosla Ventures | Series C lead | 2025 lead investor | Led the company's major AI-era valuation reset | Review terms, preferences, and governance rights |
| Stifel / Goldman Sachs | May 2025 credit facility | Debt providers | Introduce leverage and covenant considerations into the cap stack | Review covenants, draw conditions, and liens |
| Nebius / Yandex legacy | 2021 contribution; later warrants | Residual historical stakeholder | Important for geopolitics and cap-table interpretation despite no reported equity in 2025 article | Confirm warrant mechanics and expiry terms |
Map covers the investors and lenders explicitly visible in retained public sources; it is not a full cap table and should not be read as exhaustive ownership disclosure.
[CO017, CO018, CO019, CO020, CO023, CO024]1.4 Milestones, geopolitical context, and disclosed risk
The milestone record is strong enough to establish reusable ground truth for the rest of the report. The technical origin started in 2009, production use came in 2012, and open-source release happened in 2016. Commercial formation came in 2021 with the spin-out and back-to-back Series A and Series B financings. The next major operating phase arrived in 2022, when ClickHouse opened Amsterdam offices, launched cloud early access, and publicly explained how it was relocating engineering talent out of Russia after the invasion of Ukraine. Those moves are not just historical footnotes; they are part of the governance and customer-trust story that made later enterprise growth possible. The key adverse lens is therefore not a classic demand problem but a trust-and-execution one. ClickHouse carried Yandex and Russia-origin perception risk into Western enterprise sales, and it also accumulated publicly disclosed security vulnerabilities, including multiple memory-safety issues and a 2024 query-cache access-control bug. The company's mitigation case is credible but not costless: it separated legal domicile from Russia, relocated engineers to Amsterdam, emphasized Western investors and directors, and kept shipping product milestones such as the HyperDX acquisition, OpenHouse launch, and 2025 Series C. For diligence, that means the core question is no longer whether ClickHouse has escaped its origin story, but whether governance transparency, security process, and late-stage operating controls have kept pace with the speed of commercial scaling.[CO002, CO010, CO012, CO028, CO029, CO032]
| Date | Event | Type | Amount / Status | Participants | Implication |
|---|---|---|---|---|---|
| 2009 | Experimental analytical database project starts inside Yandex | founding | Alexey Milovidov and Yandex team | Technical origin of ClickHouse | |
| 2012 | ClickHouse enters production for Yandex.Metrica | product | Yandex | Demonstrates real-world scale before company formation | |
| 2016 | Open-source release under Apache 2.0 | product | ClickHouse / Yandex | Starts external developer adoption | |
| Aug 2021 | ClickHouse, Inc. incorporates and raises Series A | founding | $50M | Aaron Katz, Alexey Milovidov, Yury Izrailevsky, Index, Benchmark | Creates independent venture-backed company |
| Oct 2021 | Series B closes at $2B valuation | financing | $250M / $2B | Coatue, Altimeter, Benchmark, Lightspeed, Almaz and others | Funds cloud and global go-to-market buildout |
| Mar 2022 | Company publishes Ukraine statement and relocation clarification | adverse | Relocation accelerated | ClickHouse | Addresses Yandex/Russia perception risk |
| 2022 | Amsterdam office opens and ClickHouse Cloud enters early access | scale | Live | ClickHouse | Establishes European hub and commercial cloud phase |
| Mar 2025 | HyperDX acquisition closes | product | Acquisition | ClickHouse, HyperDX | Expands observability footprint |
| May 2025 | OpenHouse launches in San Francisco and Series C is announced | financing | $350M / ~$6.35B | Khosla Ventures and broad syndicate | Major valuation reset and market signal |
| May 2025 | $100M credit facility announced | financing | $100M | Stifel, Goldman Sachs | Adds non-dilutive capital and debt complexity |
| Oct 2025 | Series C extension and three senior executive hires disclosed | governance | Extension + hires | Citi Ventures, Insight, Peak XV; Egan, Nagy, Sexton | Deepens bench and extends financing runway |
| Apr-May 2026 | Profiles show 531-569 employees | scale | 500+ employees | PitchBook, Tracxn | Confirms late-stage operating scale |
Chronology prioritizes the company-formation lens: 2009 and 2012 refer to project milestones, while 2021 marks the legal startup formation. 2025 valuation uses third-party coverage rather than company-issued pricing.
[CO001, CO002, CO003, CO004, CO017, CO018]Selected milestones show the path from Yandex-origin project to late-stage venture-backed analytics platform.
02Market Analysis
2.1 Market boundary, included spend, and status-quo substitutes
ClickHouse should be analyzed as analytical data infrastructure rather than as a generic database. Official ClickHouse materials consistently position the product as a fast, column-oriented OLAP database used for real-time analytics, observability, data warehousing, and ML/GenAI workloads. That framing matters because the relevant spend is not “all databases,” and it is not “all business intelligence software” either. The included budget pool is the infrastructure layer where teams need high-ingest, low-latency SQL analytics on large event, telemetry, or warehouse datasets. Inside the boundary are four primary spend buckets. First is cloud data warehousing and BI acceleration, where analytics engineering teams replace or complement slower warehouse layers with a faster analytical store. Second is real-time product and event analytics, where product or data platform teams ingest streaming events and serve dashboards or user-facing analytical applications. Third is observability and log analytics, where SRE, platform, and security teams store and query logs, metrics, traces, and high-cardinality OpenTelemetry data. Fourth is AI-adjacent analytics infrastructure, where teams want vector-aware retrieval, fast aggregations, and operational analytics around AI systems. The market boundary explicitly excludes OLTP systems of record, front-end BI tools as a standalone category, and generalized data lake storage that does not itself deliver low-latency SQL serving. The status-quo substitute set is strong and segmented: BigQuery is the default serverless cloud data warehouse for many Google Cloud buyers; Datadog, Elastic, and AWS OpenSearch bundle logs, metrics, traces, search, and increasingly AI or vector workflows inside their own managed platforms; and self-managed open-source infrastructure remains the control-oriented alternative for teams that prefer to avoid managed service lock- in. ClickHouse’s advantage is that it can sit across these use cases with one core engine, but the boundary logic only works if the chapter treats it as overlapping analytical spend rather than one monolithic “database market.”[CM001, CM002, CM003, CM004, CM017, CM032]
| Segment / category | Included spend | Excluded spend | Buyer / payer | Relevance to ClickHouse |
|---|---|---|---|---|
| Cloud data warehouse / BI acceleration | Managed and self-hosted analytical storage and query infrastructure for dashboards and business analytics | Front-end BI licenses and generic ETL-only tools | Analytics engineering or data platform team / centralized data budget | ClickHouse data warehousing page explicitly positions the product as a real-time warehouse that improves BI speed and concurrency |
| Real-time product and event analytics | Event ingestion query serving and low-latency SQL analytics for apps dashboards and operational reporting | OLTP systems of record and stream processing software sold without analytical serving | Product engineering or data engineering / VP Engineering or platform budget | ClickHouse official site and use-case pages center on sub-second analytical queries over continuously ingested data |
| Observability and log analytics | Logs metrics traces high-cardinality OpenTelemetry data retention and querying | Ticketing or incident-management SaaS without analytical storage | SRE platform engineering or security operations / infra or observability budget | ClickStack puts ClickHouse directly into the OTel observability storage and query layer |
| AI and model-adjacent analytics | Vector-aware retrieval analytics agent telemetry and analytical context around AI systems | Core model training or inference spend without analytical storage | AI platform team / ML infrastructure or innovation budget | ClickHouse homepage and adjacent competitor pages show the market moving toward AI-powered analytics and observability |
| Self-managed analytical infrastructure | Customer-operated clusters on local hardware on-prem or cloud VMs across AWS GCP and Azure | Managed SaaS control planes when sovereignty or full operational control is not required | Platform engineering / infra budget | Deployment flexibility is a core differentiator because ClickHouse can serve buyers who reject fully managed-only products |
| Status-quo substitutes | BigQuery Datadog Elastic AWS OpenSearch and internal self-managed open-source stacks | Pure productivity software categories with no analytical data plane | Mixed by workload and incumbent stack owner | These are the platforms ClickHouse most often displaces or complements depending on the use case |
The relevant boundary is analytical infrastructure workload spend, not every database dollar. ClickHouse is best understood as a reusable engine across warehouse, event analytics, and observability workloads.
[CM001, CM002, CM003, CM013, CM032, CM034]2.2 TAM floor, adjacent market lenses, and overlap discipline
The cleanest conservative TAM floor for ClickHouse is the cloud data warehouse category. Mordor Intelligence values that market at $14.94 billion in 2026, while Research and Markets values it at $14.53 billion in 2026; both imply a current addressable category already above $10 billion before adding any observability or real-time analytics adjacency. IndustryARC is materially more bullish at $39.1 billion by 2026, which is directionally useful but should be treated as a high-end estimate because it likely sweeps in a broader set of DWaaS and data storage workloads than ClickHouse captures directly. A broader upper-bound lens comes from streaming analytics. Grand View Research sizes that market at $23.4 billion in 2023 and $128.4 billion by 2030 at a 28.3% CAGR, with hosted deployment already the majority of revenue. ClickHouse benefits from the same demand shift toward real-time insight, but this is not a clean ClickHouse TAM because the streaming analytics category includes stream-processing software, services, and adjacent tooling above the database layer. Observability provides the third market lens. Grand View Research, MarketsandMarkets, and Mordor Intelligence all place the current observability market in the multi-billion-dollar range, with cloud deployment and large enterprises leading adoption. This is directly relevant because ClickStack positions ClickHouse as the storage and query layer for logs, metrics, traces, and high-cardinality OpenTelemetry data. The overlap problem is material: warehouse, real-time analytics, and observability budgets are not additive because the same buyer may use one platform for more than one workload. The correct conclusion is not to sum them, but to note that the conservative current TAM floor already exceeds $10 billion and that no public source cleanly isolates the narrower “real-time columnar OLAP database” segment that would be closer to ClickHouse’s true SAM.[CM019, CM020, CM021, CM022, CM023, CM024]
| Publisher | Year | Geography | Value | CAGR | Methodology | Confidence | Limitation |
|---|---|---|---|---|---|---|---|
| Mordor Intelligence | 2026-2031 | Global | $14.94B in 2026 -> $49.12B by 2031 | 26.86% | Cloud data warehouse market sizing with vendor set led by AWS Google Microsoft Snowflake Oracle | medium | Broad adjacent category; not a pure ClickHouse-specific SAM |
| Research and Markets | 2026-2030 | Global | $14.53B in 2026 -> $31.7B by 2030 | 21.5% | Cloud data warehouse market report with explicit trend stack including AI compute-storage separation and real-time data processing | medium | Still broad DWaaS category rather than a columnar-OLAP-only layer |
| MarketsandMarkets | 2026 | Global | Segmented cloud data warehouse market by application vertical deployment model and type | n/a | Market taxonomy corroboration that warehouse demand is split across customer analytics deployment and org size | low | Free fetched text did not surface the headline market number cleanly |
| IndustryARC | 2021-2026 | Global | $39.1B by 2026 | 31.4% | High-end cloud data warehouse forecast emphasizing IoT OLAP MPP and DBaaS demand | low | Likely the broadest estimate in the set and may overstate ClickHouse-relevant spend |
| Grand View Research | 2023 base; 2030 forecast | Global | $23.4B in 2023 -> $128.4B by 2030 | 28.3% | Streaming analytics market including software services deployment and end-use segmentation | medium | Useful upper-bound adjacency rather than a clean ClickHouse TAM |
| Grand View Research | 2023 base; 2030 forecast | Global | $2.71B in 2023 -> $5.40B by 2030 | 10.7% | Observability tools and platforms market | medium | Narrower than ClickHouse’s full scope and centered on observability only |
| MarketsandMarkets | 2023-2028 | Global | $2.4B in 2023 -> $4.1B by 2028 | 11.7% | Observability tools and platforms market with recession and remote-access framing | medium | Smaller time window and vendor-defined category boundary |
| Mordor Intelligence | 2026-2031 | Global | $3.35B in 2026 -> $6.93B by 2031 | 15.62% | Observability market with explicit 2026 baseline | medium | Relevant to ClickStack only; not inclusive of warehouse or event analytics workloads |
The table is intentionally multi-lens. The cloud data warehouse rows provide the conservative current TAM floor; streaming analytics provides the broadest real-time adjacency; observability sizes the telemetry-specific wedge where ClickHouse is increasingly active.
[CM019, CM020, CM021, CM022, CM023, CM024]Layered view from broad streaming analytics TAM to the conservative cloud data warehouse floor that is already relevant to ClickHouse.
[CM001, CM002, CM003, CM019, CM020, CM025]Source-backed low/high bands for the major adjacent categories relevant to ClickHouse.
The cloud data warehouse row uses the two explicit 2026 point estimates fetched from Mordor Intelligence and Research and Markets. The observability row brackets Grand View Research's 2024 current-scale point and Mordor's 2026 baseline. The streaming row is a tight band around Grand View Research's 2023 market size. The final row preserves IndustryARC's more aggressive 2026 warehouse estimate as a separate high-end lens rather than blending it into the base band.
[CM019, CM020, CM023, CM025, CM028, CM031]2.3 Buyer, user, payer, and deployment-path segmentation
ClickHouse’s buyer map splits by workflow rather than by industry slogan. The first buyer is the data platform or product analytics team that needs real-time event analytics for dashboards, user-facing applications, or operational reporting. Here the user is usually a data engineer, backend engineer, or analytics engineer; the budget owner is a VP of Engineering or head of data platform; and the adoption trigger is performance pain in an incumbent warehouse or the need to serve interactive analytics on continuously ingesting event streams. The second buyer is the BI or analytics engineering organization modernizing warehouse and customer analytics workloads. ClickHouse’s official warehousing page explicitly positions the product as a real-time data warehouse that improves query speed and concurrency at lower cost. In this segment, the payer is often a centralized data platform or IT budget, and the comparison set includes BigQuery and other managed warehouse platforms. The third buyer is the SRE, observability, or platform engineering team dealing with log, metric, and trace volume. ClickStack’s positioning around OpenTelemetry, sub-second queries, and high-cardinality telemetry is aimed directly at these teams, while competitor pages from Datadog, Elastic, and AWS OpenSearch show the same buyer already shopping unified, managed observability platforms. For these buyers, the payer is usually an infrastructure, platform, or security operations budget, and the key trigger is cost or performance pain at current log retention and query volumes. A fourth, earlier-stage segment is AI/ML platform teams that want fast analytical storage, vector-aware retrieval, and observability around AI systems. The adoption path usually starts with a self-hosted proof of concept or existing open-source use, then moves toward ClickHouse Cloud or managed ClickStack when the organization wants autoscaling, simpler upgrades, and lower operations burden. That dual cloud/self-managed path is strategically important because some buyers are explicitly optimizing for control and sovereignty while others are optimizing for faster time to production.[CM005, CM006, CM007, CM008, CM009, CM010]
| Segment | Buyer | User | Payer | Workflow | Budget owner | Adoption trigger |
|---|---|---|---|---|---|---|
| Real-time product and event analytics | VP Engineering or head of data platform | Data engineer backend engineer analytics engineer | Product or platform budget | Ingest event streams and serve sub-second dashboards or user-facing analytics | VP Engineering or platform lead | Warehouse latency limits or need for interactive analytics on live data |
| BI and warehouse modernization | Head of analytics engineering or director of data | Analytics engineer BI engineer data architect | Central data platform or IT budget | Replace or complement slower warehouse layers for better concurrency and lower cost | Chief data officer or data platform owner | Loading spinners high query latency or escalating warehouse spend |
| Observability and OTel data | SRE lead platform engineering manager or SecOps lead | SRE observability engineer platform engineer | Infrastructure observability or security operations budget | Store and query logs metrics traces and high-cardinality telemetry | VP Infrastructure or head of SRE | Rising log retention cost or poor query performance on incumbent observability stack |
| AI and model-adjacent analytics | ML platform lead or CTO | ML engineer data engineer platform engineer | ML infrastructure or innovation budget | Add vector-aware retrieval analytical context and telemetry around AI systems | CTO or head of AI platform | Need to unify AI telemetry and analytical context with operational data |
| Control-sensitive self-hosted deployments | Platform architect or compliance-conscious infrastructure leader | Platform engineer database engineer | Infrastructure budget | Run ClickHouse directly on-prem or on AWS GCP Azure instead of using a fully managed SaaS-only platform | Platform or infrastructure lead | Sovereignty compliance or desire to avoid managed-service lock-in |
Buyers are segmented by workflow and operational need, not by industry slogan. One company can contain more than one ClickHouse buyer if warehouse and observability budgets sit in different organizations.
[CM005, CM006, CM007, CM008, CM010, CM011]ClickHouse serves multiple analytical infrastructure buyers through one shared engine.
[CM002, CM003, CM006, CM008, CM009, CM010]2.4 Growth drivers, adoption constraints, and timing implications
Three demand vectors matter most for ClickHouse through the current cycle. First is the structural rise of real-time analytics: Grand View Research attributes streaming analytics growth to real-time forecasting, digitalization, and broader adoption of big data, IoT, and AI. ClickHouse’s official real-time analytics page maps directly to that workload, emphasizing continuous ingest, high query concurrency, and interactive SQL at scale. Second is cloud data warehouse modernization. Research and Markets highlights scalable storage/compute separation, real-time data processing, and predictive/operational analytics as top cloud warehouse trends; those themes match ClickHouse Cloud’s own storage-compute separation and autoscaling pitch. Third is observability data growth. Grand View, Grafana, IBM, and Elastic all point to the same market direction: cloud-native complexity, OpenTelemetry standardization, and AI-driven observability workflows are increasing the value of fast, efficient telemetry storage. The adoption constraints are equally important. Incumbents now sell integrated experiences, not just raw storage engines. BigQuery combines enterprise data warehouse, real-time analytics, and AI. Datadog and Elastic combine logs, metrics, traces, and AI-assisted investigation. AWS OpenSearch combines search, observability, serverless deployment, and vector workflows. These products create switching costs because the buyer is often comparing whole operating systems, pricing models, and governance frameworks rather than benchmark numbers alone. Cost discipline is also two-sided. ClickHouse markets lower infrastructure cost, less replica overhead, and aggressive compression, while Grafana and IBM both argue that observability in 2026 will shift from “collect everything” to higher-value telemetry and cost management. That means ClickHouse benefits when buyers want a more efficient storage/query engine, but it can lose when a team prefers to reduce data collection volume or stay inside an incumbent’s bundled platform rather than introduce another analytical layer. Timing-wise, the strongest near-term tailwinds are AI-infused analytics, OTel-native observability, and replacement of slower warehouse or log stacks; the main constraints are migration effort, incumbent platform bundling, and the need to sell across several budget owners instead of one clean software line-item.[CM009, CM010, CM013, CM014, CM015, CM021]
| Driver / constraint | Direction | Timing | Implication | Diligence ask |
|---|---|---|---|---|
| Real-time forecasting digitalization IoT and AI | Tailwind | Current and multi-year | Expands the volume of time-sensitive analytical workloads that favor columnar OLAP infrastructure | Measure what share of new ClickHouse workloads are event analytics versus traditional BI |
| Cloud warehouse modernization | Tailwind | Current | Buyers want compute-storage separation real-time processing and predictive analytics without warehouse latency pain | Ask how often ClickHouse enters as replacement versus acceleration layer alongside incumbent warehouses |
| OpenTelemetry-first observability | Tailwind | Current and strengthening into 2026 | Standardized telemetry and high-cardinality data increase demand for efficient log metric and trace storage | Quantify revenue and customer count from ClickStack or other observability-flavored deployments |
| AI-enabled analytics and AI observability | Tailwind | Emerging but immediate | AI agents analytics copilots and model telemetry create new analytical storage and retrieval demands | Ask what percentage of pipeline mentions AI or model telemetry as a primary buying trigger |
| Tool consolidation around unified observability | Mixed tailwind | Current | ClickHouse can benefit as a lower-cost core engine but may lose if buyers prefer staying inside incumbent suites | Request win-loss analysis where consolidation programs helped versus hurt ClickHouse |
| Integrated incumbent platforms | Constraint | Current and structural | BigQuery Datadog Elastic and AWS OpenSearch bundle adjacent workflows and reduce appetite for an additional platform | Get win rates by incumbent and by workload to identify where ClickHouse actually displaces bundled alternatives |
| Migration and operational change cost | Constraint | Current | Teams still need to move schemas queries data pipelines or telemetry workflows even if ClickHouse benchmarks better | Ask for median deployment time and professional-services burden by segment |
| Telemetry cost discipline and data-value filtering | Constraint | Current | Some buyers will reduce data volumes or retention instead of changing storage engines | Request retention-tier usage patterns and cost-out stories that prove ClickHouse wins after data-value optimization |
Tailwinds are strongest where workload growth and cost pressure happen together. Constraints rise when incumbent platforms can bundle enough adjacent functionality to make a switch feel operationally risky.
[CM009, CM010, CM013, CM014, CM021, CM026]Typical path from incumbent pain to self-managed proof of concept and then managed or scaled production deployment.
[CM003, CM005, CM006, CM007, CM008, CM009]2.5 Sizing gaps, contradictory estimates, and diligence asks
The biggest analytical gap is category overlap. Cloud data warehouse, streaming analytics, and observability are all valid market lenses for ClickHouse, but none is a perfect proxy for the narrower layer that ClickHouse actually monetizes. Summing the categories would overstate TAM because one platform can serve multiple workloads for the same customer, while using only one category understates the company’s actual scope. The chapter therefore preserves the estimates as lenses instead of forcing a synthetic SAM number. A second gap is the lack of public segmentation for ClickHouse itself. The available fetched evidence supports strong community scale, broad use-case coverage, and cross-vertical customer adoption, but it does not disclose what share of ClickHouse revenue comes from observability versus warehouse workloads, what percentage of customers choose cloud versus self-managed, or how AWS, GCP, and Azure mix differs by region and buyer type. Without those disclosures, public SOM estimation would be performative rather than analytical. The most useful diligence asks are therefore internal: revenue mix by workload, cloud service count by provider and segment, net retention by deployment model, median telemetry or event volume by customer cohort, and win-loss data against BigQuery, Datadog, Elastic, and AWS OpenSearch. Those answers would convert a broad, clearly large TAM story into a tighter and more investable SAM/SOM picture.[CM017, CM022, CM030, CM041, CM042, CM043]
2.6 Exhibits
03Competitors
3.1 Competitive landscape and market boundary
ClickHouse competes in a broader field than the label “data warehouse” implies. The company incorporated in 2021 around an open-source project that was already well known in analytical databases, and its commercial pitch now spans real-time analytics, data warehousing, observability, and AI-linked serving workloads. That breadth pulls in two kinds of direct rivals. First are broad analytical platforms such as Snowflake and Databricks, which can capture the same strategic budget when a buyer wants one governed system for engineering, analytics, and AI. Second are incumbent hyperscaler services such as BigQuery, Redshift, and Athena, which can solve enough of the same job while benefiting from existing cloud procurement relationships. The substitute set is different again: DuckDB, Apache Druid, StarRocks, and SingleStore each cover narrower slices of the same analytical problem, especially embedded analytics, streaming-first analytics, or low-latency serving. The practical status quo is therefore not one alternative but a mix of broader suites, bundled cloud services, and open-source or self-managed point solutions.[CP002, CP010, CP018, CP021, CP023, CP025]
| Competitor | Category | Scale / funding | Target segment | Deployment / open-source posture | Best-fit workloads | Pricing / positioning |
|---|---|---|---|---|---|---|
| ClickHouse | Reference platform | Commercial company incorporated in 2021; $350M Series C in May 2025; 2,000+ customers | Engineering-led teams building user-facing analytics, observability, and fast warehouse workloads | Open-source core, self-managed server plus managed cloud on major marketplaces | Real-time analytics, observability, data warehousing, AI-adjacent analytical serving | Usage-based cloud pricing with separate compute and storage; public philosophy clearer than realized enterprise net price |
| Snowflake | Direct incumbent | $9.77B RPO; 790 Forbes Global 2000 customers; 733 $1M+ customers | Enterprise analytics, governed data sharing, cross-cloud SQL and AI buyers | Managed multi-cloud service; proprietary platform with strong governance controls | SQL analytics, data sharing, governed AI-data platform | Explicit compute, storage, and data-transfer pricing with warehouse credits and per-second billing |
| Databricks | Direct broad-platform rival | 20,000+ organizations; 70% of Fortune 500; 1,200+ partners | Enterprise data engineering, lakehouse, governance, and AI teams | Commercial lakehouse platform with open-format posture rather than an open-source core | Data engineering, lakehouse, governance, analytics, and AI workflows | Public list-price and SKU groups exist, but comparison is less simple than clear starting tariffs |
| BigQuery | Hyperscaler incumbent | Backed by Google Cloud distribution and free-tier funnel; public-company parent with disclosed investor reporting | GCP-centric analytics, BI, and AI teams | Managed Google Cloud service; proprietary but highly serverless | Serverless warehouse, AI analytics, and Google-native data applications | Compute plus storage pricing with free tier and slot reservations |
| Redshift / Athena | Hyperscaler warehouse substitute | Backed by AWS distribution and annual-report-scale parent | AWS-native data teams standardizing on S3, SageMaker, and zero-ETL paths | Managed AWS services rather than open-source products | Warehouse, lakehouse, ad hoc SQL, and serverless analytics over S3 | Redshift and Athena both publish concrete entry pricing that lowers pilot friction |
| DuckDB | Embedded substitute | Open-source foundation project; no evidence of a large enterprise field-sales motion in reviewed sources | Developers and analysts doing local, notebook, application, or embedded analytics | MIT-licensed embedded database with no server process | Local analytics, embedded analytics, single-node analytical processing | Free open-source software rather than a public enterprise SaaS tariff |
| StarRocks | Adjacent real-time challenger | Smaller independent vendor in reviewed public materials; enterprise-scale analytics pitch but little public scale disclosure | Teams wanting low-latency SQL over fresh lakehouse or real-time data | Open-source-flavored analytical database with cloud ambitions but limited public commercial detail | Real-time analytics, lakehouse querying, AI-oriented SQL serving | Public commercial pricing is less transparent than major hyperscaler or DBaaS rivals |
| Apache Druid / Imply | Streaming-first substitute | Apache project plus commercial Imply distribution; Polaris public starter and standard tiers | Streaming-heavy analytics, ad-tech, telemetry, and customer-facing real-time dashboards | Open-source Druid core with commercial cloud and enterprise wrappers from Imply | Streaming-first real-time analytics and high-concurrency querying | Open-source core plus DBaaS tiers starting at $100/month and $600/month |
| SingleStore | Adjacent HTAP challenger | Private distributed-SQL vendor with major enterprise logos but limited public scale metrics | Teams combining transactional, analytical, and application-serving workloads | Cloud DBaaS plus self-managed deployment across VMs, cloud hosts, Docker, and Kubernetes | Real-time applications, HTAP-style workloads, and RAG-ready operational analytics | Usage-based credit pricing with storage charges and commitment options |
Scale and funding detail is public where disclosed; for private vendors with limited transparency, the row uses qualitative scale and route-to-market signals rather than invented revenue figures.
[CP006, CP007, CP010, CP014, CP018, CP020]Ordinal positioning of ClickHouse and retained alternatives on sovereign deployment flexibility versus breadth plus distribution power.
Axes are analyst-derived ordinal scores synthesized from public product and deployment materials; they are not audited market-share measurements.
[CP003, CP004, CP014, CP018, CP021, CP025]3.2 Profiles, workload fit, and open-source posture
ClickHouse is strongest when an engineering-led buyer wants high-concurrency analytical serving, fast SQL over large datasets, or one engine that can support observability and real-time product analytics without forcing a broad suite commitment. Snowflake and Databricks are the closest broad-platform competitors, but for different reasons. Snowflake brings the most mature public scale and an AI Data Cloud story with explicit governance and warehouse economics, while Databricks brings a wider open-format lakehouse narrative plus much larger partner and customer reach. BigQuery, Redshift, and Athena matter less because they look identical product-for-product and more because they make “good enough” analytics purchasable inside Google Cloud or AWS. DuckDB and Druid are more specialized substitutes: DuckDB is ideal for embedded and local analytics, while Druid is built for streaming-heavy real-time use. StarRocks and SingleStore sit in the middle, overlapping with ClickHouse on low-latency analytical serving while leaning harder into lakehouse and HTAP-style positioning. The open-source story matters here: ClickHouse, DuckDB, and Druid clearly retain project-level credibility, whereas Databricks emphasizes open formats and the large incumbents emphasize managed services.[CP001, CP011, CP014, CP015, CP017, CP018]
| Buying criterion | ClickHouse | Snowflake | Databricks | BigQuery | Redshift / Athena | DuckDB | StarRocks / Druid | SingleStore |
|---|---|---|---|---|---|---|---|---|
| Real-time analytical serving | Strong | Medium | Medium | Medium | Medium | Weak | Strong | Strong |
| Broad governed data + AI suite | Partial | Strong | Strong | Strong | Partial | Weak | Weak | Partial |
| Open-source core or project credibility | Strong | Weak | Weak | Weak | Weak | Strong | Strong | Weak |
| Self-managed or sovereign deployment choice | Strong | Weak | Partial | Weak | Weak | Strong | Strong | Strong |
| Hyperscaler bundle / procurement power | Medium | Medium | Medium | Strong | Strong | Weak | Weak | Weak |
| Embedded or application-native analytics fit | Medium | Weak | Weak | Weak | Weak | Strong | Medium | Strong |
Cells are ordinal judgments synthesized from reviewed official product and documentation surfaces; they indicate posture and fit, not audited benchmark superiority on every workload.
[CP001, CP002, CP011, CP015, CP019, CP021]Visual summary of where ClickHouse leads, where suites lead, and where narrower substitutes remain credible.
Heatmap labels are evidence-backed ordinal judgments summarizing product posture rather than head-to-head benchmark scores.
[CP001, CP011, CP015, CP019, CP021, CP025]3.3 Pricing, deployment models, and GTM strength
Pricing and deployment model are where ClickHouse most clearly separates from the field. Its public pricing narrative stresses independent storage and compute scaling, autoscaling, and scale-to-zero economics, while its documentation keeps self-managed and managed consumption on the same underlying engine. Snowflake is more explicit than ClickHouse about billing mechanics: warehouses consume credits, sizes are published, and the platform separates compute, storage, and data-transfer costs. Databricks is public but less benchmark-friendly, publishing list prices and SKU groups instead of one simple comparable list rate. BigQuery, Redshift, Athena, Imply, and SingleStore all expose clearer public starting points than ClickHouse, which helps buyers model pilots and intermittent workloads. Deployment model also matters for trust and regulation: Snowflake, BigQuery, and Athena are primarily managed-service choices; SingleStore still supports self-managed deployment; and DuckDB is local by design. That mix drives GTM outcomes. Hyperscaler-owned products can ride broad procurement leverage, while Databricks can rely on a much larger installed base and partner set. ClickHouse therefore wins most naturally where technical differentiation matters enough to overcome a smaller field motion.[CP003, CP004, CP005, CP012, CP013, CP016]
| Platform | Price / unit / contract model | Included capabilities | Discount / unknowns | Competitive implication |
|---|---|---|---|---|
| ClickHouse | Usage-based cloud pricing with separate compute and storage, autoscaling, and scale-to-zero behavior | Managed cloud for real-time analytics, warehousing, observability, and AI-serving use cases | Public pricing philosophy is clear, but realized enterprise discount bands are not disclosed | Strong for engineering buyers who value efficient bursty workloads, weaker for CFOs wanting a one-line public list rate |
| Snowflake | Compute credits plus storage and data-transfer charges; warehouses billed per second with a 60-second minimum | Managed multi-cloud analytics and AI data platform with governed warehouses | Net price varies by edition, cloud, and negotiated credit economics | Highly benchmarkable billing mechanics make Snowflake easy to model against ClickHouse |
| Databricks | Undiscounted list prices and SKU groups across cloud providers | Lakehouse platform spanning data engineering, governance, analytics, and AI | Public list price exists, but practical comparison depends on SKU mix and negotiated terms | Broad platform scope is strong, but list-price complexity obscures easy apples-to-apples comparison |
| BigQuery | Serverless compute plus storage pricing, with free tier and slot reservations | Warehouse and AI platform services inside Google Cloud | Actual cost depends on scan volume, slot commitments, and adjacent Google services | Very easy pilot path and strong serverless simplicity for GCP-centric teams |
| Redshift | Provisioned starts at $0.543/hour and serverless at $1.50/hour, with reservation discounts | AWS-native warehouse and lakehouse service | Total cost still depends on data, concurrency, and AWS estate context | Clear entry pricing and AWS procurement leverage make Redshift a practical incumbent alternative |
| Athena | Pay for data processed or compute used with no infrastructure management | Ad hoc SQL and Spark analytics directly on S3 and other sources | Efficient only when query patterns and data layout stay disciplined | Excellent for intermittent AWS-native analytics; less differentiated as a persistent high-concurrency serving layer |
| DuckDB | Free open-source software | Embedded local analytical engine | Enterprise support and hosted control-plane economics are not the point of the product | Very strong substitute for local analytics, but not a like-for-like cloud platform replacement |
| StarRocks | Public commercial pricing is limited in reviewed sources | Real-time, lakehouse, and AI analytics engine | Customers likely need direct engagement for pricing certainty | Lower transparency makes quick procurement harder despite attractive technical positioning |
| Imply Polaris | Starter from $100/month, Standard from $600/month, Custom by quote | Real-time analytics DBaaS on top of Druid lineage | High-scale or tailored environments still require enterprise discussion | Transparent for early-stage tests and smaller workloads |
| SingleStore | Usage-based credits plus storage charges and commitment pricing | Cloud DBaaS for real-time transactions plus analytics | TCO depends on workload shape and chosen edition | Clearer than ClickHouse or Databricks for initial modeling, but optimized for a somewhat different mixed-workload buyer |
This table compares public packaging and pricing mechanics rather than negotiated net price or full workload-specific TCO.
[CP005, CP012, CP013, CP016, CP020, CP022]3.4 Moat durability, switching cost, and competitive risk
ClickHouse’s moat is real but conditional. The strongest durable edge is the combination of open-source credibility, deployment sovereignty, and a product reputation built around speed-sensitive analytical workloads. That combination is unusual among scaled vendors: Snowflake is more managed and suite-driven, Databricks is broader and more workflow-centric, and the hyperscaler services are more procurement-driven than community-driven. But that same structure limits hard lock-in. Buyers can multi-home because the category is segmented by workload and because open or self-managed alternatives remain credible. Snowflake and Databricks threaten ClickHouse from above by widening into governed data and AI suites. BigQuery, Redshift, and Athena threaten it from the side through cloud-account control and bundled adjacencies. DuckDB, Druid, StarRocks, and SingleStore threaten specific slices from below or beside by offering embedded, streaming, lakehouse, or HTAP alternatives. That means the durable underwriting question is not whether ClickHouse is technically strong; it is whether that technical edge keeps translating into wins faster than broader bundles and narrower substitutes erode the premium.[CP006, CP007, CP008, CP034, CP035, CP042]
| Moat claim | Threat | Severity | Mitigation / diligence ask |
|---|---|---|---|
| Open-source credibility and developer adoption lower adoption friction | Open substitutes such as DuckDB, Druid, and StarRocks also speak to engineers and lower lock-in | Medium | Ask management for conversion rates from open-source users to paid cloud and enterprise support |
| Flexible deployment supports sovereignty and regulated buyer narratives | Hyperscaler services still win when procurement convenience outweighs sovereignty benefits | High | Request ARR mix by self-managed, sovereign, and managed cloud deployments |
| Speed-sensitive serving and observability are attractive ClickHouse wedges | Snowflake, Databricks, and Redshift all continue to market performance improvements and AI-adjacent analytical breadth | High | Review workload-level win-loss data for observability and user-facing analytics evaluations |
| Public pricing philosophy suggests efficient burst economics | Incumbents such as AWS, BigQuery, Athena, Imply, and SingleStore often publish simpler public starting prices | Medium | Request realized enterprise discount bands and pilot-to-production cost curves |
| Capital and customer momentum make ClickHouse credible in enterprise deals | Commercial reach still trails hyperscalers and likely trails Snowflake and Databricks field coverage | High | Test partner-sourced pipeline, channel leverage, and regional sales coverage versus direct rivals |
| Workload specialization avoids becoming a me-too suite | Broader governed data-and-AI suites can absorb more budget even when ClickHouse wins on speed | High | Ask whether ClickHouse is winning primary platform deals or mainly landing as a specialist workload engine |
The key question is whether ClickHouse can turn technical and community strengths into sustained share gains against broader bundles and narrower substitutes.
[CP035, CP039, CP042, CP043, CP044, CP045]Ordinal scorecard of the dimensions most likely to determine whether ClickHouse keeps winning against broader suites and procurement-led incumbents.
Scores are analyst-derived ordinal judgments based on reviewed public evidence; they are not reported company KPIs.
[CP005, CP034, CP035, CP043, CP044, CP046]3.5 Exhibits
04Financials
4.1 Monetization and pricing architecture
ClickHouse has a clearer commercial story than it has financial disclosure. Across its cloud, pricing, and use-case pages, the company steers users from free/open-source adoption into ClickHouse Cloud, a managed service positioned around pay-for-use economics, separate compute and storage scaling, and automatic down-scaling of idle resources. The 2022 launch sequence is financially important because it shows the company iterating toward a monetization model that developers could understand: the AWS beta preceded a December 6, 2022 GA release that extended the trial to 30 days, introduced lower-spend Development Services, and improved compute metering. TechCrunch and m3ter add a useful commercial detail: management deliberately pursued a product-led motion with transparent consumption billing and simplified beta pricing from read/write units to storage plus compute before GA. Public pages also show that monetization is broader than a single OLAP SKU. ClickHouse now markets the platform across real-time analytics, data warehousing, observability through ClickStack, AI-native workloads, and managed ingestion via ClickPipes. What remains opaque is the actual mix. No reviewed public source discloses how much revenue comes from self-serve cloud, dedicated enterprise deployments, BYOC, or newer observability and AI add-ons.[CI013, CI014, CI015, CI016, CI017, CI018]
| Stream | Mechanism | Unit / pricing logic | Current public status | Revenue quality view | Diligence ask |
|---|---|---|---|---|---|
| ClickHouse Cloud multi-tenant service | Managed service for analytics, AI, and observability workloads on shared cloud infrastructure. | Usage-based billing on compute plus storage. | Clearly active and positioned as the monetization core. | High confidence on existence; low confidence on actual mix by customer cohort. | Request cloud ARR split by self-serve versus enterprise customers. |
| Dedicated cloud / isolated deployments | Higher-control cloud environments for larger or more regulated accounts. | Custom enterprise contract, likely higher ACV than self-serve. | Publicly described but not publicly priced. | Medium confidence on strategic importance; low on actual contribution. | Request ACV bands and win rates for dedicated deployments. |
| Bring Your Own Cloud (BYOC) | ClickHouse manages the control plane inside customer environments. | Negotiated enterprise pricing, likely service plus support economics. | Publicly described by Sacra as a live deployment mode. | Medium confidence on monetization path; low confidence on volume. | Request number of BYOC customers and contracted ARR. |
| Observability / ClickStack on Cloud | Managed observability stack on ClickHouse Cloud for logs, traces, metrics, and replays. | Cloud usage plus retention/ingestion economics. | Publicly marketed and tied to cost-efficiency claims. | Medium confidence on product relevance; low confidence on current revenue share. | Request observability-specific customers, ARR, and retention. |
| Real-time analytics workloads | User-facing dashboards, fraud, marketing, and operational analytics on ClickHouse Cloud. | Consumption grows with ingest, concurrency, and storage. | Core marketed workload category. | High confidence it drives paid usage, but specific sector mix is private. | Request top verticals and gross margin by workload family. |
| Data warehousing and BI | Modern data warehouse positioning for BI and concurrency-heavy workloads. | Consumption pricing with enterprise expansion potential. | Core marketed workload category. | High confidence on demand surface; low confidence on warehouse-only revenue mix. | Request warehousing ARR, typical data footprint, and competitive win-loss detail. |
| Managed ingestion / ClickPipes and integrations | Managed connectors and ingestion tooling that simplify onboarding and expansion. | Likely monetized through higher cloud usage, premium features, or attach rates. | Publicly visible but not separately priced. | Medium confidence that it helps expansion; low confidence on direct monetization. | Request attach-rate and upsell data for ClickPipes and partner integrations. |
Partial enumeration of publicly visible revenue surfaces as of 2026-05-27; reviewed sources do not disclose actual revenue mix or realized pricing.
[CI013, CI020, CI021, CI022, CI030, CI031]| Surface | Public pricing signal | What is actually disclosed | Financial implication | Exact diligence ask |
|---|---|---|---|---|
| Open-source core | Free | The core database remains open source and free to use. | Creates adoption leverage but no separately disclosed license line. | Request paid-conversion funnel from community to cloud. |
| Cloud trial / freemium entry | 30-day trial and $300 credits | Official cloud pages still offer free trial credits to start usage. | Supports low-friction PLG acquisition, but trial-to-paid conversion is undisclosed. | Request trial conversion rates by segment and cohort. |
| Standard cloud consumption | Usage-based compute + storage | Official pages say compute and storage scale separately and customers pay only for what they use. | Good fit for bursty workloads; realized street pricing remains opaque. | Request realized price per compute unit and storage TB after discounts. |
| Development Services | Low-monthly-spend entry tier | 2022 GA notes introduced a starter-oriented Development Services offer. | Improves developer onboarding, but gross profit and upsell yield are undisclosed. | Request current package limits, unit economics, and upgrade conversion. |
| Dedicated / BYOC enterprise | Negotiated | Public sources say enterprise customers can choose dedicated clusters or BYOC. | Likely improves ACV and retention, but none of the reviewed sources disclose contract structure. | Request average enterprise ACV, term length, and margin delta versus self-serve. |
| Integrations / ClickPipes / observability add-ons | Not list-priced in reviewed sources | Public pages market integrations, observability, and managed ingestion, but do not break out pricing. | Could raise wallet share and retention without appearing as a separate revenue line. | Request attach rates, packaging rules, and cross-sell revenue contribution. |
Public evidence confirms usage-based and free-trial-led pricing mechanics, but not realized pricing, discounting, or blended customer economics.
[CI016, CI017, CI018, CI019, CI022, CI024]Public evidence points to a cloud-led model that converts free/open-source adoption into metered cloud revenue and broader workload expansion.
Qualitative bridge only. Public sources confirm the model components but do not disclose conversion rates, blended price realization, or cloud gross profit per stage.
[CI013, CI017, CI020, CI021, CI022, CI029]4.2 Growth signals and public estimate range
Growth evidence is unusually strong for a private infrastructure company, but it arrives in fragments rather than audited financial statements. Company-linked Series C materials said ClickHouse had grown more than 300% during the prior year and topped 2,000 customers by May 2025. Later 2025 extension materials said ARR had more than quadrupled over the preceding year, while TechCrunch reported in January 2026 that cloud ARR was still growing more than 250% year over year. The cleanest third-party numeric estimate comes from Sacra, which put 2025 annualized revenue at about $160 million, up 256% from a $45 million exit rate at the end of 2024. Taken together, those data points support a cautious public estimate band of roughly $150 million to $200 million for 2025 ARR or annualized revenue, which is consistent with the user's guidance and still avoids pretending that a private company has published audited revenue. The GTM implication is favorable but incomplete. ClickHouse appears to be compounding open-source adoption, free-trial conversion, and enterprise expansion across AI-heavy accounts, yet public sources still do not disclose CAC, payback, ACV segmentation, or net retention.[CI006, CI007, CI008, CI009, CI010, CI011]
| Metric | Public value / range | Confidence | Why it matters | Diligence ask |
|---|---|---|---|---|
| 2025 annualized revenue / ARR | $150M-$200M public estimate range (midpoint ~$160M) | Medium | Anchors valuation and scale discussion without pretending audited revenue exists. | Request monthly recurring revenue bridge and audited FY2025 revenue. |
| 2024 exit annualized revenue | $45M (Sacra estimate) | Medium | Provides the base for evaluating the 2025 growth claim. | Request monthly revenue history for 2024 and 2025. |
| Cloud ARR growth into 2026 | >250% YoY | Medium | Signals strong cloud expansion and supports the capital-raising narrative. | Request cloud ARR by quarter and by deployment mode. |
| Reported company growth before Series C | >300% during prior year | Medium | Important momentum claim, but company-linked rather than audited. | Request definition of growth metric and reconciliation to GAAP revenue. |
| Customer count | >2,000 customers by 2025 | Medium | Useful breadth signal, though customer count says little about concentration. | Request ARR concentration and ACV distribution by customer cohort. |
| Gross-margin proxy | Compute-storage separation and autoscaling may support attractive incremental margins | Medium | Explains why investors may underwrite the model despite limited public financial disclosure. | Request actual cloud gross margin, infra COGS, and support cost burden. |
| Sales-efficiency proxy | PLG free trial, 100+ early cloud customers, open-source community funnel | Low-Medium | Suggests efficient acquisition, but does not replace CAC or payback. | Request CAC, sales ramp, and payback by self-serve versus enterprise. |
| Net retention / churn | Not publicly disclosed | Low | Critical for judging revenue quality and durability. | Request NRR, GRR, logo churn, and expansion contribution by cohort. |
Mixes disclosed facts with explicit estimates and qualitative proxies; no reviewed public source provides audited ClickHouse unit-economics tables.
[CI010, CI011, CI012, CI023, CI024, CI025]Range figure comparing the public 2025 revenue estimate with disclosed capital pools and valuation anchors.
Revenue numbers are estimates, not audited results. Funding totals combine company-announced equity rounds and the publicly disclosed $100M credit facility.
[CI004, CI005, CI006, CI011, CI012, CI039]4.3 Capital adequacy and opaque cost structure
Capital access looks strong, but liquidity visibility is poor. ClickHouse started with roughly $50 million of Series A capital in August 2021, expanded that base with a $250 million Series B at a $2 billion valuation in October 2021, and then raised a $350 million Series C in May 2025 led by Khosla Ventures. The 2025 materials also disclosed a $100 million credit facility led by Stifel and Goldman Sachs, and January 2026 reporting tied a new $400 million round to a $15 billion valuation. Those are powerful signals that investors still believe managed cloud analytics can compound quickly. But the public record still omits the underwriting essentials: cash on hand, monthly burn, infrastructure COGS, gross margin, net retention, customer concentration, and any detail on whether the credit line is drawn or covenant constrained. Public company data-platform peers continue to file current annual reports and risk factors that let outsiders benchmark margin structure and competition. ClickHouse does not. As a result, the financing stack is visible, while the cash-flow bridge and downside protection remain private.[CI001, CI002, CI003, CI004, CI005, CI006]
| Item | Public value | Date | Why it matters now | Residual gap |
|---|---|---|---|---|
| Earlier funding base | Roughly $50M Series A | Aug-2021 | Shows the company entered Series B from a relatively modest capital base. | No public burn bridge from Series A to Series B. |
| Series B | $250M at $2B valuation | Oct-2021 | Established the company as a well-funded independent commercial entity. | No public use-of-proceeds tracking versus actual spend. |
| Series C | $350M led by Khosla Ventures | May-2025 | Major recapitalization behind AI-era scale claims and product expansion. | No public cash balance or runway disclosure after the raise. |
| Credit facility | $100M led by Stifel and Goldman Sachs | May-2025 | Adds non-equity liquidity capacity. | Drawn amount, covenants, interest, and maturity are not public. |
| Series C extension | Additional capital from Citi Ventures, Insight Partners, Peak XV and others | Oct-2025 | Signals continued financing access after the initial Series C. | Exact extension size is not disclosed in reviewed sources. |
| Latest valuation signal | $15B valuation on a reported $400M Jan-2026 round | Jan-2026 | Confirms strong investor appetite and balance-sheet optionality. | Still does not disclose current cash, burn, or dilution structure. |
| Current liquidity | Not publicly disclosed | As of 2026-05-27 | This is the core blocker to true runway underwriting. | Need cash, forecast burn, and covenant headroom from management. |
Table focuses on current capital adequacy signals rather than repeating a full company-history chronology; liquidity and debt detail remain private.
[CI001, CI002, CI003, CI004, CI005, CI006]| Missing metric | Why it matters | Current public state | Exact diligence path |
|---|---|---|---|
| Revenue mix by product / deployment mode | Needed to judge revenue quality and the share coming from durable enterprise cloud. | Not disclosed. | Request cloud ARR split by self-serve, dedicated, BYOC, observability, and support. |
| Realized pricing and discounting | Needed to distinguish list philosophy from actual monetization. | Not disclosed. | Request customer contracts, discount schedules, and renewal pricing cohorts. |
| Gross margin and infra COGS | Needed to test whether compute-storage separation truly yields attractive cloud unit economics. | Not disclosed. | Request gross margin bridge, hosting spend, support costs, and contribution margin by deployment type. |
| NRR, GRR, and churn | Needed to test durability of cloud expansion and multi-workload consolidation. | Not disclosed. | Request retention by cohort, downgrades, and expansion contribution. |
| Cash, burn, and runway | Needed to judge financing dependency and downside protection. | Not disclosed. | Request balance sheet, cash flow statement, monthly burn, and base / downside runway. |
| Debt terms and facility draw | Needed to test covenant risk and incremental leverage. | Facility exists but terms are not disclosed. | Request credit agreement, drawn balance, pricing grid, collateral, and covenant headroom. |
| Customer concentration / top account exposure | Needed to assess whether AI-led wins are diversified or concentrated. | Not disclosed. | Request top-10 customer ARR share, cohort gross margin, and renewal schedule. |
These are the main blockers preventing a public-only underwriting case even though the growth and fundraising signals are unusually strong.
[CI024, CI025, CI026, CI027, CI035, CI036]Cost structure is directionally favorable on paper, but the public record still leaves gross profit and infrastructure burden unverified.
Qualitative only. The sources support the economic mechanism but not the actual gross margin or contribution margin values.
[CI023, CI024, CI025, CI037, CI038]Funding signals are strong, but every path still ends in a liquidity-information gap.
Signal flow only. Public funding and valuation events are known, but they do not disclose the present cash bridge or debt usage.
[CI001, CI003, CI005, CI006, CI027, CI039]4.4 Underwriting view
The financial verdict is therefore strong on top-line momentum and weak on disclosure quality. ClickHouse has credible evidence of a cloud-led business model, a freemium/open-source funnel, multi-workload monetization, and unusually fast growth for a private data platform. The 2025 and 2026 financing trail also suggests that investors were willing to keep underwriting the story at materially higher valuations. But public data is still insufficient for hard underwriting on revenue quality. There is no disclosed revenue mix between self-serve and enterprise cloud, no disclosed gross margin or infrastructure cost burden, no disclosed net retention, and no disclosed liquidity bridge despite the mix of equity and credit. The right interpretation is not that ClickHouse lacks financial quality; it is that outsiders can currently see the growth case much more clearly than the downside case. Any investment memo should therefore treat the roughly $150 million to $200 million 2025 ARR or annualized revenue estimate and the margin-upside thesis as plausible, but still dependent on private data room evidence before conviction is warranted.[CI011, CI012, CI024, CI025, CI026, CI027]
4.5 Exhibits
05Product & Technology
5.1 Product Scope & Delivery Model
ClickHouse is not a single SKU so much as a product family anchored on one analytics engine. The public record supports three concrete delivery surfaces. First, there is the open-source column-oriented OLAP database itself. Second, there is ClickHouse Cloud, which is explicitly positioned as a fully managed service and available across the major cloud marketplaces/providers. Third, there are managed workflow surfaces around the engine—most visibly ClickPipes for ingestion plus cloud operations surfaces such as the SQL console and clickhousectl. That means the customer decision is less about whether ClickHouse is “database or platform” and more about which layer of the stack they want ClickHouse to operate for them. In customer workflow terms, the product is strongest where a team wants fast analytical storage plus operational convenience, not where it wants a complete semantic BI stack out of the box. ClickHouse Cloud’s pitch is operational relief: serverless operations, autoscaling, backups, replication, and provider choice. ClickPipes reduces the amount of custom ingestion plumbing a team has to maintain. Microsoft’s connector and Azure integration evidence show the workflow extending into BI and event pipelines, while the module map shows that the cloud control plane materially changes setup, scaling, and integration work compared with a raw self-managed deployment. The main diligence nuance is that many surrounding workflows still depend on third-party or partner tools rather than a single vertically integrated ClickHouse-native application layer.[CE001, CE002, CE003, CE004, CE005, CE006]
| Module / asset | Primary user | Status / maturity | Differentiation | Diligence gap |
|---|---|---|---|---|
| Open-source ClickHouse server | Data platform and infrastructure teams | Production-mature core product with broad OSS adoption | Vectorized columnar OLAP engine with MergeTree storage model | Need customer-specific proof on migration effort from incumbent warehouses or search stacks |
| ClickHouse Cloud managed service | Teams that want managed analytics infra | GA across major cloud marketplaces/providers with active 2026 region work | Managed operations, autoscaling, shared storage design, parallel replicas, compute separation | Need clearer public SLOs and workload-specific benchmark methodology |
| ClickPipes managed ingestion | Platform teams moving event or CDC data into ClickHouse | In market and expanding by region/connectors in 2026 | Cloud-native ingestion without custom ETL or consumer fleet management | Connector depth and backlog handling vary by source and should be tested on real schemas |
| SQL console and clickhousectl | Operators, analysts, and developers | In market | Native operating surface for querying and managing cloud services | Public feedback still asks for deeper query-plan and role-governance UX |
| Official client and connector ecosystem | Application developers and analytics engineers | High-signal maintenance footprint in 2026 | Python, JavaScript, Docker, Power BI/Fabric, dbt and ODBC paths already documented | Connector coverage is broad, but support ownership is split across core, partner, and community tiers |
This matrix distinguishes the core engine, managed cloud, ingestion plane, admin surfaces, and ecosystem packages; it is not a revenue decomposition.
[CE001, CE002, CE003, CE005, CE019, CE024]How teams move from source systems into managed analytics workflows on ClickHouse.
This workflow abstracts over several deployment choices. Not every customer uses every node, but each node is explicitly evidenced in the fetched materials.
[CE019, CE020, CE021, CE022, CE025, CE026]5.2 Engine, Storage & Cloud Architecture
The core technical differentiation remains the engine design. ClickHouse’s architecture material explicitly describes a vectorized execution model with optional code compilation, while the lower-level architecture page explains that data is processed as column chunks and most operations are dispatched over arrays rather than scalar values. That is consistent with what the product is built to do: large analytical scans, aggregates, filters, and joins over wide datasets rather than row-by-row transactional workloads. The storage layer is built around the MergeTree family, where inserts create immutable parts that are merged in the background. Those parts are sorted by primary key, and the index is sparse: it records marks for granules instead of every row, which keeps indexes small enough to stay memory-resident for very large datasets. That architecture matters because it explains both the performance upside and the operational tradeoffs. The sparse primary index and ordered storage improve pruning and compression, but the same docs note that sparse reads can still pull extra rows per block, so performance depends on schema design and key choice. In cloud deployments, ClickHouse adds another layer of differentiation: compute and storage separation, object-backed parallel replicas, Shared Catalog coordination, and compute-compute separation for read/write workload isolation. The Shared database engine documentation makes clear that SharedMergeTree-style stateless compute is not just a hosted VM wrapper; it is a different operating model designed for dynamic compute environments where local disks should not own durable state.[CE009, CE010, CE011, CE012, CE013, CE014]
| Layer / component | Role | Dependency | Risk |
|---|---|---|---|
| Query processing layer | Parses, plans, and runs analytical SQL with vectorized execution and optional code compilation | Core engine internals and column-chunk processing model | Performance advantage depends on workload fit and schema design, not on query speed claims alone |
| MergeTree storage parts | Stores immutable sorted parts and merges them in the background for high-ingest analytical tables | Primary key design, merge settings, background resources | Bad key design or too many tiny parts can erode ingest/query efficiency |
| Sparse primary index and granules | Keeps indexes memory-efficient while enabling data skipping | Sorting key choice and index_granularity | Sparse reads may still pull extra rows, so pruning is workload-sensitive |
| Shared catalog / Shared database engine | Coordinates stateless cloud compute and SharedMergeTree-style tables without local-disk ownership | Central catalog state and Keeper-backed coordination | Control-plane or metadata coordination is now a more important dependency than local-disk durability |
| Cloud compute pools and object-backed parallel replicas | Scale reads and writes with separated compute and replicated object-backed storage access | Cloud orchestration plus underlying object storage and provider primitives | Provider-region maturity and preview features can affect feature availability |
| Integration layer and table engines | Connects ClickHouse to brokers, databases, lake formats, and object stores | Connector quality plus external system APIs and schemas | Real customer experience depends on connector ownership and integration support tier |
This architecture table combines stable engine design with cloud operating-model details. It is an evidence-backed product architecture map, not a full source-code or control-plane diagram.
[CE009, CE010, CE011, CE012, CE013, CE014]Engine layers and cloud operating layers that together explain ClickHouse’s product differentiation for analytics workloads.
This figure is a product architecture map synthesized from documentation and product pages, not a complete source-code or infra topology diagram.
[CE009, CE010, CE011, CE012, CE013, CE014]5.3 Integrations, Ecosystem & Workflow Fit
ClickHouse’s workflow story is broad and pragmatic rather than purely native. The integrations index explicitly separates core, partner, and community integrations, which is a useful signal: ClickHouse supports a wide ecosystem, but not every connector is maintained to the same standard or with the same support model. For ingestion, the strongest public evidence is the combination of the Kafka engine and ClickPipes. The Kafka engine provides the low-level table-engine path with consumer-group, security, and materialized-view controls, while ClickPipes is the managed cloud path for Kafka, S3, Postgres, MongoDB, GCS, MySQL, and other sources. The Cloudflare Logpush guide is particularly valuable because it shows a concrete production pattern—S3 as durable buffer plus exactly-once and replay semantics—rather than generic integration marketing. The developer surface is also unusually strong for an infrastructure company. GitHub activity, package distribution, and container adoption all point to a mature open-source ecosystem: the main repo has large star/fork counts and frequent releases, the Python and JavaScript clients are actively maintained, and the Docker image has very high pull volume. Independent sources reinforce that the product is not limited to classic dashboarding; adopters and company stories span observability, cloud platforms, SEO, blockchain, and customer-facing analytics. The caveat is architectural rather than ecosystemal. HypeQuery’s analysis argues that once deployments scale, teams often build semantic, translation, or self-service layers above ClickHouse so analysts and business users do not need to reason directly about highly optimized schemas. In other words, ClickHouse fits the performance core of the workflow extremely well, but the last mile of governed self-service is frequently assembled from adjacent tooling.[CE019, CE020, CE021, CE022, CE023, CE024]
| User job | Current workflow | ClickHouse solution | Measurable benefit | Limitation |
|---|---|---|---|---|
| Ingest streaming events and logs | Build or operate custom consumers, landing zones, and retries | Kafka engine or ClickPipes-managed ingestion into ClickHouse Cloud | Managed ingestion reduces custom ETL and can add replay/exactly-once style semantics via buffered object storage paths | Operational behavior still depends on source system, schema drift, and connector maturity |
| Model warehouse transforms | Run SQL models and CI/CD outside the database | dbt-clickhouse adapter with incremental, MV, distributed and testing support | Lets analytics engineers standardize transforms and deployment workflows around ClickHouse | Adapter limitations still exist for some distributed and very large model patterns |
| Connect BI and semantic tools | Export or copy data into a separate BI store | Direct BI connectivity via ODBC/Power Query plus broader core/partner/community integrations | Supports DirectQuery/import and keeps analytics closer to operational data | Some cloud-service scenarios still require an ODBC driver and gateway bridge |
| Operate customer-facing or observability analytics | Scale dashboards and queries on large event streams | ClickHouse core plus cloud management, caching, and scale features | Independent and official evidence both point to sub-second or real-time analytical workflows at scale | Teams may still need custom abstraction layers for broad self-service |
| Democratize self-service analytics | Analysts ask platform teams for query help on optimized schemas | Use ClickHouse as the performance core under semantic or translation layers | Preserves engine performance while widening internal access to governed metrics | This abstraction layer is often ecosystem-built rather than natively owned by ClickHouse |
Benefits in this table are source-backed workflow outcomes, not audited ROI claims. The last-mile self-service layer often sits outside the native ClickHouse product.
[CE019, CE020, CE021, CE022, CE023, CE024]External systems and supporting layers that shape the full ClickHouse workflow beyond the core database engine.
The dependency map mixes native and ecosystem dependencies because buyers evaluating full workflow adoption care about both.
[CE003, CE010, CE024, CE027, CE046, CE047]5.4 Trust, Roadmap & Product Risks
On trust and operational maturity, the public surface is better than average for an infrastructure product but still incomplete for hard enterprise diligence. ClickHouse publicly documents a meaningful control set—SSO, MFA, RBAC, private connectivity options, IP filtering, CMEK, and a compliance list that includes GDPR, HIPAA, ISO 27001, PCI DSS, and SOC 2. The Azure GA announcement adds concrete platform-level claims around network isolation, traffic encryption, and multi-availability-zone replication. Those are useful signals that the managed service is designed for production-sensitive workloads, not only experimentation. The remaining risk is not absence of controls but uneven public depth. TrustRadius feedback still points to role-granularity and identity-provider limitations, while the broader workflow evidence suggests self-service deployments may need an extra semantic or translation layer that ClickHouse does not fully own. The 2026 changelog shows an active platform roadmap—autoscaling changes, spend controls, index sharding preview, AWS/GCP/Azure region work, and BYOC expansion—but preview features are not yet the same as mature, widely deployed defaults. For buyers, the implication is straightforward: ClickHouse appears technically strong for high-performance analytics workloads, especially when cloud-managed, but enterprise diligence should still press on benchmark methodology, identity governance depth, and the amount of surrounding platform code a team must build to democratize access safely.[CE028, CE029, CE030, CE031, CE032, CE033]
| Control / quality signal | Status | Scope | Implication | Gap |
|---|---|---|---|---|
| Compliance baseline | Publicly listed | GDPR, HIPAA, ISO 27001, PCI DSS, SOC 2 and related items | Signals an enterprise-oriented cloud posture rather than a hobbyist managed service | Public list is not the same as scoped audit evidence for a buyer’s required control set |
| Identity and access controls | Publicly listed | SSO, MFA, RBAC, IP filtering, CMEK, private connectivity | Shows ClickHouse understands buyer expectations for enterprise cloud controls | Public detail on SCIM, role granularity, and IdP compatibility is still thin |
| Network and availability controls | Publicly described for Azure GA | Network isolation, traffic encryption, multi-AZ replication | Supports production analytics workloads that need resilience and protected traffic paths | Need provider-by-provider and tier-by-tier detail on recovery objectives |
| Operational UX feedback | Independent mixed feedback | SQL console, roles, and SSO experience | Independent review evidence helps separate product-control claims from operator experience | Review evidence is anecdotal and should be validated with reference customers |
| Connector dependencies | Known requirement | ODBC driver and gateway for some Microsoft cloud flows | BI connectivity exists today and is not purely aspirational | Extra gateway/driver steps add implementation friction relative to truly browser-native SaaS connectors |
This table captures only controls and quality signals visible in fetched sources. It should not be read as a substitute for customer-facing security, privacy, or architecture review packs.
[CE028, CE029, CE030, CE042, CE050]| Date / stage | Feature / milestone | Status | Implication | Source |
|---|---|---|---|---|
| 2026-04 | Dual-window vertical autoscaling | Rolling / launched | Cloud service is tuning cost responsiveness for variable workloads rather than freezing around a single lookback window | 2026 cloud changelog |
| 2026-04 | Marketplace subscription sharing across AWS, Azure, and GCP | Launched | Confirms real multi-cloud commercial plumbing, not just marketing availability | 2026 cloud changelog |
| 2026-04 | Index sharding | Private preview | Shows active work on reducing memory pressure and improving performance for heavy index workloads such as vector and full-text search | 2026 cloud changelog |
| 2026-04 to 2026-05 | BYOC on GCP plus AWS/Azure/GCP region expansion | GA / launched | Strengthens platform reach for data residency and provider-choice requirements | 2026 cloud changelog |
| 2026-05 | Organization spend alerts and primary-service idling | GA | Signals maturing cloud-finops and workload-isolation capabilities | 2026 cloud changelog |
| 2026-05-26 | clickhouse-connect 1.1.0 package release | Released | Client ecosystem is still shipping meaningful updates during the run window | PyPI clickhouse-connect |
This is a public roadmap-and-release visibility table. It reflects what is publicly visible by the run date, not an internal product-management backlog.
[CE005, CE031, CE032, CE033, CE036]Public maturity view across core engine, managed cloud, ingestion, ecosystem, and buyer-facing trust signals.
These maturity labels are analytical judgments based on public evidence quality and feature rollout status as of the run date.
[CE005, CE031, CE032, CE033, CE042, CE046]5.5 Exhibits
06Customers
6.1 Customer Base and Segment Mix
ClickHouse's visible customer base is not broad in a generic SMB sense; it is deep in a handful of technical buyer groups that all need low-latency analytics over very large datasets. The dominant recurring segment is observability and telemetry: Cloudflare, OpenAI, Anthropic, Tesla, Qonto, Langfuse, and Lyft all use ClickHouse to store or query logs, traces, metrics, or internal analytical event streams where speed and high-cardinality exploration matter more than classical BI polish. A second major segment is customer-facing product and usage analytics, visible in Microsoft Clarity, Replo, Mintlify, Padlet, Ramp, Buildkite, and Polymarket, where ClickHouse sits behind dashboards, reporting, budgeting, experimentation, or leaderboard products that external customers touch directly. The common buyer is usually an engineering, platform, or data team; the end user is a broader set of analysts, operators, finance teams, teachers, developers, or SaaS customers who are intentionally insulated from raw ClickHouse complexity. That segmentation is a real strength because it clusters around workloads where ClickHouse's performance advantage is tangible, but it also means customer concentration risk should be assessed by workload family as much as by logo count: public proof is much denser in observability and real-time analytics than in classical enterprise warehouse use cases. The accessible public record also splits cleanly by evidence quality. Recent case-study customers provide high-confidence production proof with named speakers and quantified outcomes, while legacy marquee names like eBay, Spotify, Uber, and ByteDance appear mainly in adopter lists or slide references and should be treated as weaker proof unless diligence surfaces fresher deployment detail. [CU001, CU002, CU003, CU004, CU035, CU036]
| Segment | Buyer / User / Payer | Primary Use Case | Representative Scale Signal | Revenue / Strategic Value | Key Gap |
|---|---|---|---|---|---|
| Observability and telemetry platforms | Buyer = platform/SRE/data engineering; users = internal engineers and operators; payer = infra/platform budget owner | Logs, traces, metrics, incident analytics, billing analytics | Cloudflare quadrillion-row analytics; OpenAI petabyte-per-day logs; Tesla quadrillion-row Comet | Anchor workload where ClickHouse's performance edge is clearest and most defensible | No public disclosure of ARR mix or renewal rates by observability cohort |
| Product and customer analytics SaaS | Buyer = product/data engineering; users = product teams and end customers; payer = product or analytics budget | Embedded dashboards, behavior analytics, experimentation, attribution, reporting | Microsoft Clarity millions of projects; Replo 100B+ events; Padlet 40M monthly users; Buildkite 12B tests/month | Enables customer-facing analytics SKUs and higher-ARPU enterprise reporting surfaces | Public evidence over-indexes to successful migrations and may omit failed pilots |
| AI and LLM-native software | Buyer = platform/ML infrastructure; users = model, infra, and observability teams; payer = core platform budget | LLM observability, agent tracing, secure telemetry, model operations | Anthropic Claude 4 observability; Langfuse billion-scale agent traces; Mintlify agent-vs-human doc analytics | Fastest-growing proof cluster and strong 2025-2026 freshness | Unclear whether usage remains departmental or expands into large platform contracts |
| Fintech and marketplace operations | Buyer = engineering/data teams; users = finance ops, marketplace ops, risk teams; payer = product/platform budget | Spend analytics, forecasting, budgets, fraud/risk, leaderboard and market stats | Ramp 50,000+ customers; Qonto 600,000+ SMBs; Polymarket 100s rps leaderboard API | Demonstrates monetizable analytics embedded inside operational products | No public contract length, upsell rates, or multi-product penetration data |
| Developer tools and CI/CD analytics | Buyer = engineering platform; users = developers, release managers, QA teams; payer = engineering productivity budget | Test analytics, feature analytics, release observability | Buildkite 70B records and 25k eps peaks show strong developer-tool analytics fit | Good fit for high-cardinality, self-serve internal analytics with land-and-expand potential | Spending may be sensitive to developer-tool consolidation cycles |
| Industrial, education, and enterprise operations | Buyer = operations/data leaders; users = business operators, teachers, analysts; payer = line-of-business owner | Factory intelligence, classroom engagement analytics, operational BI | Contentsquare 13-month retention after migration; Padlet classroom metrics in 242 of 246 countries | Shows ClickHouse can reach beyond pure devtools into embedded operational analytics | Public stories do not reveal account sizes or renewal depth |
Segments are grouped from public case studies and adopter references rather than a disclosed customer roster. Null-like gaps reflect missing public commercial detail, not a claim that the segment is immaterial.
[CU001, CU002, CU003, CU004, CU021, CU022]| Metric | Value | Date / Period | Source | Confidence | Implication | Missing Denominator |
|---|---|---|---|---|---|---|
| Cloudflare query scale | 96T events/hour; 1.61 quadrillion/day in <2 seconds | 2025 meetup / 2026 story | ClickHouse Cloudflare story | high | Strongest public flagship proof of real production scale | No Cloudflare spend, cluster cost, or contract value disclosed |
| Cloudflare deployment tenure | In production since late 2016; 1,000+ active replicas; hundreds of millions inserted rows/sec | 2016-2023 deployment history | ClickHouse meetup report | high | Suggests unusually durable long-lived production use | Does not reveal commercial relationship with ClickHouse Inc. |
| OpenAI ingest growth | Petabytes/day, >20% monthly growth, 90 shards x2 replicas | 2025 | ClickHouse OpenAI case study | high | Confirms hyperscale observability fit under viral demand | No spend, retention, or SaaS contract terms disclosed |
| OpenAI surge resilience | 50% overnight log-volume spike; 40% CPU reduction after index fix | March 2025 | ClickHouse OpenAI case study | high | Shows operational elasticity plus upstream collaboration value | One account-specific incident, not portfolio-wide performance data |
| Tesla load test | 1B rows/sec for 11 days; >1 quadrillion rows ingested | 2025 | ClickHouse Tesla case study | high | Confirms extreme telemetry and PromQL-compatibility use case | Internal deployment economics not disclosed |
| Microsoft Clarity footprint | Millions of projects; hundreds of trillions of events; hundreds of PB | 2020-2026 current architecture description | Microsoft Clarity engineering blog | high | Strong embedded analytics proof from a large customer-owned surface | Public post does not identify ClickHouse commercial model |
| Buildkite usage growth | 3B → 12B test executions/month in six months; 70B records YTD; 25k eps peaks | 2025 | ClickHouse Buildkite case study | high | Clear adoption expansion after initial deployment | Customer count using Test Engine analytics not disclosed |
| Padlet real-time pipeline | 8B events/month; 14 rps; 45 ms median; 690 ms p99 | 2025-2026 | ClickHouse Padlet case study | high | Shows embedded classroom analytics at mass-market scale | No monetization or contract-size detail |
| Qonto observability compression | 231 TB uncompressed attributes stored in 376 GB (99.84% compression) | 2026 | ClickHouse Qonto case study | high | Strong proof of cost/scale advantage in financial-services operations | Savings not directly translated to ClickHouse contract expansion |
| Lyft analytics throughput | >450 TB read/day; ~4 TB write/day; hundreds of qps average, peaks in the thousands | 2025-2026 | ClickHouse Lyft case study | high | Demonstrates large-enterprise internal adoption breadth | Internal-only analytics does not automatically equal large external revenue |
The table mixes current and historical proof points because ClickHouse does not publish a standardized customer KPI set. Missing denominators mainly reflect absent contract, seat, or ARR context rather than weak technical scale.
[CU005, CU007, CU009, CU010, CU012, CU013]Typical buyer path from problem discovery to production deployment and internal expansion across the strongest public customer stories.
[CU003, CU021, CU022, CU023, CU024, CU025]6.2 Named Production Users and Deployment Patterns
The strongest production proof starts with Cloudflare, which has used ClickHouse in production since 2016 and now relies on it across HTTP and DNS analytics, log analytics, Workers runtime analysis, internal analytics, customer dashboards, Firewall Analytics, Radar, and usage billing. Public disclosures span old Cloudflare-authored engineering blogs, ClickHouse-hosted meetup reports, and a fresh 2026 ClickHouse customer story, giving unusual continuity across almost a decade of deployment history. Beyond Cloudflare, ClickHouse's best current references are mostly 2025-2026 customer stories on its own site: OpenAI discusses petabyte-scale observability with 90 shards and surge-driven optimization work; Anthropic describes a custom air-gapped, Cloud-like architecture used to ship Claude 4; Tesla details a quadrillion-row Comet platform for metrics; Microsoft Clarity explains how ClickHouse turned 30-minute heatmap generation into an instantaneous task at hundreds-of- trillions-of-events scale; and Contentsquare shows the migration pattern from Elasticsearch to ClickHouse for multi-tenant SaaS analytics, with 11x lower infrastructure cost and 10x p99 query improvement. The same pattern repeats in newer cloud-native SaaS stories such as Replo, Mintlify, Padlet, Buildkite, Ramp, Qonto, Lyft, and Polymarket: ClickHouse is usually adopted first to remove latency, cardinality, or cost bottlenecks in one product surface, then expanded into adjacent use cases once teams trust it in production. That creates a credible land-and-expand story inside accounts, but it also means many references are vendor-authored case studies rather than customer-authored engineering writeups, so outcome claims should be weighted by the independence and freshness of the supporting source. [CU005, CU006, CU007, CU008, CU009, CU010]
| Customer | Segment | Deployment / Use Case | Production vs Pilot | Outcome | Limitation |
|---|---|---|---|---|---|
| Cloudflare | Cloud infrastructure / observability | HTTP & DNS analytics, log analytics, Workers runtime analytics, Radar, billing analytics | Production, long-running (since 2016) | 96T events/hour and 1.61 quadrillion/day in <2s; 1,000+ active replicas; millions of calls/day for billing jobs | Best-in-class evidence, but commercial terms with ClickHouse are undisclosed |
| Contentsquare | Digital experience analytics SaaS | Main SaaS analytics migration from Elasticsearch to ClickHouse | Production, full migration completed customer-by-customer | 11x lower infrastructure cost, 10x p99 improvement, 13-month retention, zero regression during migration | Guest post is strong, but still published on ClickHouse's own blog |
| OpenAI | AI / LLM observability | Petabyte-scale observability for research, ChatGPT, and enterprise APIs | Production | Petabytes/day of logs, 90 shards x2 replicas, 50% surge handled with architecture changes and 40% CPU optimization | Strong technical proof, but no contract or renewal detail |
| Anthropic | AI / LLM observability | Secure, air-gapped observability stack used to support Claude 4 development and monitoring | Production | Custom Cloud-like deployment inside Anthropic secure environment; named operator says ClickHouse was instrumental to shipping Claude 4 | Outcome is strategic but less numerically explicit than Cloudflare or Tesla |
| Tesla | Industrial / fleet observability | PromQL-compatible Comet platform for massive metrics analytics | Production | Tens of millions of rows/sec live ingest; 1B rows/sec test for 11 days; >1 quadrillion rows ingested | Case study is vendor-authored and does not reveal spend or seat expansion |
| Microsoft Clarity | Product analytics | Free analytics, heatmaps, dashboards, and visual reporting | Production | Millions of projects; hundreds of trillions of events; heatmaps went from ~30 minutes to instantaneous | Customer-owned engineering proof is strong, but no ClickHouse commercial packaging detail |
| Replo | Merchant analytics SaaS | In-product analytics for Shopify merchants tracking sessions, purchases, A/B tests, and AOV | Production | 4,000+ merchants, 100B+ events, 3,000-5,000 events/sec, dashboards kept responsive with ~1 minute lag | Strong recent proof, but note this is Replo, not the requested Reprise logo |
| Buildkite / Ramp / Qonto / Lyft / Polymarket cohort | Developer tools / fintech / mobility operations | Real-time analytics, budgets, CI dashboards, observability, forecasting, and leaderboard APIs | Production | Each discloses concrete usage or savings metrics, showing repeatable land-and-expand beyond one flagship logo | Most evidence remains vendor-authored case studies rather than customer-authored blogs |
This table prioritizes named deployments with usable production detail. eBay, Spotify, Uber, and ByteDance remain public adopter references, but not high-detail rows here; Discord and Reprise did not surface in fetched corroborating sources and are tracked as evidence gaps instead of being treated as verified deployments.
[CU006, CU007, CU010, CU011, CU013, CU014]| Customer / Logo | Main Public Proof | Freshest Fetched Evidence | Evidence Quality | Quantified Outcome Visibility | Diligence Implication |
|---|---|---|---|---|---|
| Cloudflare | ClickHouse case study plus Cloudflare engineering blogs | May 2026 | High | High | Treat as anchor reference customer |
| OpenAI | ClickHouse customer story with named engineering speakers | 2025-2026 current | High | High | Strong for scale proof; still request contract and expansion detail |
| Anthropic | ClickHouse customer story with named technical owner | 2025 | High | Medium | Strong strategic proof, moderate numeric detail |
| Tesla | ClickHouse customer story with named senior engineer and stress-test metrics | 2025 | High | High | Strong for extreme-scale observability reference selling |
| Microsoft Clarity | Customer-owned engineering blog | 2026 | High | High | Valuable because proof is not hosted by ClickHouse |
| Contentsquare | Guest migration blog plus external architecture roundup | 2022-2026 | Medium-high | High | Strong migration/cost proof but still partly vendor-channel hosted |
| Uber | External architecture summary plus older slide reference in adopter list | 2020-2026 | Medium | Medium | Need direct current-customer reference before treating as flagship proof |
| Spotify / eBay | Official adopter-list entry and older slide / site references | 2018-2020 legacy | Low-medium | Low | Good logo signal, weak current production detail |
| ByteDance | External adopter summary / community list level proof | 2026 external list | Low | Low | Do not underwrite with this logo without primary evidence |
Evidence quality is based on source independence, freshness, named operator detail, and quantified outcome specificity. Discord and Reprise did not surface in fetched corroboration and therefore remain outside the scored rows.
[CU004, CU011, CU013, CU014, CU020, CU021]Common deployment path visible across ClickHouse customer stories, showing how the product typically progresses from evaluation to broader organizational adoption.
[CU011, CU012, CU021, CU022, CU023, CU024]Public customer evidence sorted by independence, quantified outcome specificity, freshness, and production maturity.
[CU004, CU005, CU006, CU011, CU013, CU014]6.3 Retention Proxies, Procurement, and Expansion Motion
ClickHouse does not publicly disclose customer NRR, GRR, logo retention, renewal rates, or top-account expansion cohorts, so durability has to be inferred from weaker proxies. The best available public proxies are product reviews and the shape of customer stories. On the positive side, PeerSpot users rate ClickHouse 8.6/10 and repeatedly cite performance, compression, scalability, and the lack of hard vendor lock-in because the self-hosted open-source version remains available. TrustRadius reviewers similarly describe ClickHouse as a primary real-time warehouse and praise MergeTree performance and data skipping. On the negative side, the same review set raises recurring complaints around cloud RBAC granularity, SSO gaps, documentation, UI maturity, setup complexity, and cloud-cost management. Those are not fatal, but they imply retention depends on technically capable customers who can absorb product rough edges. Public customer stories also show a consistent procurement and expansion path. Teams often start with a narrow high-value workload, prove latency or cost gains, then expand into adjacent product surfaces. Open-source trialability lowers evaluation friction, while ClickHouse Cloud's separate storage and compute, autoscaling, scale-to-zero, and operational offload support expansion once workloads reach multi-team or enterprise importance. That procurement shape is favorable, but without renewal and expansion-rate data it remains a thesis, not a verified customer durability metric. [CU015, CU016, CU022, CU023, CU024, CU025]
| Metric | Value / Status | Segment | Confidence | Diligence Ask |
|---|---|---|---|---|
| Net revenue retention (NRR) | Not publicly disclosed | Portfolio-wide | null | Request NRR by cloud, self-managed support, and enterprise support cohorts |
| Gross revenue retention (GRR) | Not publicly disclosed | Portfolio-wide | null | Request GRR and gross logo retention by top customer cohort and workload family |
| Renewal / contract length | Not publicly disclosed in fetched public materials | Enterprise customers | low | Request median contract term, renewal rate, and early expansion timing |
| PeerSpot product rating | 8.6 / 10 | Independent reviewers / evaluators | medium | Validate review count trend and enterprise-vs-open-source mix behind the score |
| TrustRadius qualitative sentiment | Positive on performance and data skipping; negative on SQL console, cloud RBAC, and SSO gaps | Practitioners / data engineers | medium | Compare review themes with support ticket volume and churn reasons |
| Vendor lock-in proxy | Open source repeatedly cited as reducing lock-in and easing trial procurement | Technical buyers | medium | Measure what percent of cloud wins started from self-managed or OSS usage |
| Support / operations proxy | Cloud users cite operational offload; reviewers still note documentation and setup complexity | Cloud and self-hosted users | medium | Request support SLA attainment, time-to-value, and implementation success rate |
| Product rough-edge proxy | Complaints center on UI/security/admin maturity and cost monitoring rather than core performance | Enterprise evaluators | medium | Request top enterprise objections, lost-deal reasons, and post-sale escalation categories |
These are public retention proxies, not true commercial retention metrics. Review-site sentiment and operational feedback indicate buyer satisfaction and friction but cannot substitute for renewal, contraction, or expansion data.
[CU031, CU032, CU033, CU034, CU038]6.4 Concentration Risk and Evidence-Quality Assessment
The public record supports two opposite conclusions at once. First, ClickHouse does not look dangerously concentrated to any single named customer or industry in the superficial sense: fetched case studies span cloud infrastructure, frontier AI, developer tools, fintech, education, digital banking, consumer internet, and industrial analytics. That breadth reduces obvious single-vertical concentration. Second, the proof base is clearly concentrated by workload family. Observability, telemetry, customer-facing analytics, and high-cardinality operational reporting are vastly overrepresented relative to other data-platform use cases. If that buyer set softens, if hyperscaler-native alternatives narrow the performance gap, or if new entrants commoditize observability backends, public reference density could become a false sense of diversification. Evidence quality also varies materially across names. Cloudflare is the gold-standard reference because the deployment is corroborated by both ClickHouse and Cloudflare engineering surfaces over many years. Contentsquare is also strong because it combines a guest technical migration post with an independent architecture roundup. OpenAI, Anthropic, Tesla, Microsoft Clarity, Mintlify, Padlet, Buildkite, Ramp, Qonto, Lyft, and Polymarket are strong current vendor-authored case studies with named operators and metrics, but still primarily originate from ClickHouse-controlled publishing channels. By contrast, eBay, Spotify, Uber, and ByteDance are best treated as medium- to low-confidence logo proof until diligence verifies current scope, contract value, and deployment status. Discord and Reprise did not surface in the fetched official or public-corroboration set, so they should not be counted as production proof without direct customer or contract evidence. The largest remaining blind spots are top-customer ARR share, renewal economics, and whether the most visible public logos translate into durable commercial concentration or simply marquee marketing references. [CU004, CU014, CU020, CU035, CU036, CU037]
| Expansion Driver / Risk | Type | Impact | Diligence Path |
|---|---|---|---|
| Open-source-to-cloud conversion | Expansion driver | Lowers initial procurement friction and creates later monetization path into managed cloud or enterprise support | Request funnel from OSS evaluator to paid cloud conversion by segment and vintage |
| Multi-workload expansion inside accounts | Expansion driver | Customers often start with one analytics workload and then add adjacent observability, billing, release, or AI use cases | Request account-level module expansion timelines and net expansion by first workload |
| Managed-service operational offload | Expansion driver | ClickHouse Cloud reduces patching, sharding, and capacity-planning burden for small platform teams | Request win-rate delta for cloud vs self-managed in enterprise deals |
| Workload-family concentration | Concentration risk | Public proof clusters heavily around observability and real-time customer analytics, implying GTM concentration by use case | Request ARR mix by observability, product analytics, warehousing, and AI/LLM cohorts |
| Marquee-logo evidence fragility | Concentration risk | eBay, Spotify, Uber, and ByteDance remain mostly list-level proof, so perceived logo depth may exceed verified current usage depth | Obtain direct customer references, current spend ranges, and recent usage attestations for marquee names |
| Top-customer revenue opacity | Concentration risk | No fetched public source discloses top-10 customer revenue share, largest-customer ARR, or renewal concentration | Request top-10 ARR concentration, largest logo share, and concentration trend over the last eight quarters |
| Discord / Reprise proof gap | Concentration risk | Requested names were not corroborated in fetched official/public sources, creating diligence noise around logo accuracy | Ask management for direct reference calls or current contract evidence before counting these logos in the bull case |
Expansion drivers are evidenced by repeated migration and second-workload case studies, while concentration risks reflect missing commercial disclosure and uneven evidence depth across logo cohorts.
[CU030, CU031, CU035, CU036, CU037, CU038]6.5 Exhibits
07Risks
7.1 Severity ranking and underwriting frame
ClickHouse’s risk profile is unusual because the company simultaneously enjoys obvious product momentum and a still-fragile underwriting record. The strongest public positives are real: the company raised $350 million at a $6.35 billion valuation in May 2025, said it had grown more than 300% year over year, and disclosed more than 2,000 customers spanning AI, observability, real-time analytics, and warehousing workloads. But the valuation case is still carried more by usage momentum and category positioning than by disclosed economics. The financing materials do not publish ARR, revenue, gross margin, or profitability, so the path from open- source adoption and free-trial conversion to durable public-market-quality earnings remains opaque. The most important risks therefore cluster around five themes: open-source commoditization and governance tension; competition from Snowflake and Databricks for large cloud data budgets; emerging substitution pressure from StarRocks on join-heavy workloads and DuckDB on local developer workflows; cloud execution risk in uptime, security, and patching; and customer-quality risk because public materials describe scale without showing concentration, retention, or cohort economics. None of these risks is fatal on its own, but together they set a high bar for post-funding execution.[CR001, CR002, CR003, CR034, CR035, CR037]
| Signal | Evidence | Why adverse | What offsets it | What to monitor next |
|---|---|---|---|---|
| Valuation without public economics | The 2025 financing disclosed customer growth but not ARR, margins, or profitability. | Execution risk is being capitalized before public unit economics are visible. | The company clearly has product momentum and blue-chip customers. | Revenue-quality disclosure and cohort economics |
| Open-core anxiety from ecosystem insiders | Altinity says meaningful features are becoming cloud-only and governance needs clearer separation. | Community trust can erode even while product demand remains strong. | Apache 2.0 core, large community, and many deployment modes still reduce immediate lock-in. | Roadmap transparency, contributor sentiment, and any foundation-style governance moves |
| Independent benchmark caveats on joins and concurrency | Exasol reported weaker distributed completeness and 1.39x runtime degradation at higher concurrency. | Buyers with the wrong workload mix may not realize the headline benchmark story. | ClickHouse still dominates many aggregation-centric and cost/performance narratives. | Reference customers for normalized, join-heavy, and high-concurrency use cases |
| Enterprise-plan gating of compliance | HIPAA and PCI are enterprise features and SLAs cover select committed-spend contracts. | Monetization may depend on a narrower set of premium accounts than raw user growth implies. | Upsell potential is real if those buyers expand. | Plan-tier ARR mix and enterprise gross retention |
| Visible cloud uptime is good but not elite | Public status page showed 98.62% aggregate uptime across Feb-May 2026. | The cloud product is still carrying service-quality execution risk in public. | AWS-specific uptime was reported at 100% and uptime is transparently disclosed. | Service-tier incident distribution and customer-level SLO attainment |
These are not claims of failure; they are the most supportable public signals that could widen into thesis-breakers if management execution slips.
[CR009, CR020, CR025, CR034, CR038, CR041]Residual risk is concentrated in monetization quality, competition, and OSS governance rather than in a single unresolved legal event.
[CR015, CR020, CR023, CR028, CR035, CR037]7.2 Competition, substitution, and open-source monetization risk
The clearest structural risk is that ClickHouse has to win two battles at once. Against Snowflake and Databricks, it must prove that an open-source-rooted engine can capture premium cloud analytics budgets rather than just serve as a performance adjunct or migration target for specific workloads. Independent rankings still show a wide mindshare gap: Snowflake and Databricks sit near the top of the cloud data stack while ClickHouse ranks lower despite rising momentum. At the same time, the lower end of the market is fragmenting. DuckDB is a credible substitute for local, embedded, and developer-first analytics where users can postpone or avoid a hosted service altogether. StarRocks is the sharper direct threat in operational analytics because published benchmark material keeps framing ClickHouse as weaker on complex multi-table and distributed-join scenarios. The monetization overlay matters even more because Altinity’s critique shows how ClickHouse Cloud-only features can improve paid-product differentiation while also deepening community distrust, increasing fork maintenance burden, and making the OSS roadmap feel subordinate to cloud packaging. That is a viable strategy, but it raises execution risk precisely because ClickHouse’s developer community is large enough to notice and react.[CR020, CR021, CR023, CR024, CR027, CR028]
| Threat | Why it matters | Public evidence | Likelihood | Severity | Current offset |
|---|---|---|---|---|---|
| Snowflake and Databricks budget gravity | They remain the reference cloud data platforms for large enterprise budgets and hold much stronger independent mindshare. | DB-Engines ranked Snowflake #6 and Databricks #7 versus ClickHouse #26 in May 2026; TechCrunch still labels ClickHouse a challenger. | high | high | ClickHouse has a strong performance narrative and migration stories for targeted workloads |
| StarRocks join-heavy momentum | Benchmark discourse keeps positioning StarRocks as stronger on wide-table and multi-table scenarios that matter in operational analytics. | StarRocks-sponsored Habr benchmark claimed 2.2x better SSB performance and said ClickHouse could not complete its TPC-H set. | medium-high | high | ClickHouse markets improving JOIN support and still wins many aggregation-centric workloads |
| DuckDB local / embedded substitution | Developers can solve many exploratory and embedded analytics problems without adopting a hosted cloud service. | Exasol benchmark says DuckDB remains attractive where operational simplicity matters; DB-Engines shows DuckDB still gaining mindshare. | medium | medium-high | ClickHouse counters with local CLI, clickhouse-local, and chDB to keep users in its ecosystem |
| OSS good-enough competition | If self-managed ClickHouse, forks, or managed alternatives solve enough use cases, cloud monetization can lag broad adoption. | Altinity warns cloud-only features and open-core drift change community incentives; Tinybird markets managed ClickHouse and alternative developer experience. | medium | high | Large community and official cloud tooling still give ClickHouse a strong upgrade path |
| Performance narrative reverses on customer workloads | Independent benchmarks can be cited against ClickHouse if concurrency or distributed-join behavior disappoints under real buyer workloads. | Exasol reported 1.39x concurrency degradation and weaker distributed completeness on TPC-H style tests. | medium | medium-high | ClickHouse still owns a powerful cost/performance narrative on its benchmark hub and product pages |
Benchmark evidence mixes independent and competitor-authored material; treat directionality as more reliable than precise multiples.
[CR023, CR024, CR025, CR026, CR027, CR028]Competition, governance tension, and reliability issues all transmit into the same valuation-support variables: growth quality, retention, and cloud margins.
Edges are directional and qualitative rather than weighted; the map is intended to show transmission channels into valuation support.
[CR020, CR023, CR025, CR027, CR036, CR037]7.3 Cloud execution, security, and legal-compliance risk
ClickHouse has done meaningful work on compliance and cloud operations, but the public record still supports a serious execution-risk register. On the positive side, the company documents SOC 2 Type II, ISO 27001, U.S. DPF, HIPAA, PCI, GDPR, and CCPA workstreams, which should matter to regulated buyers. Yet the same materials show that not every protection is universal: some controls and certifications are enterprise-plan features, SLAs are limited to select committed-spend contracts, and uptime remains a live metric instead of a quiet assumption. The public status page reported only 98.62% aggregate uptime across February through May 2026, which is directionally fine for a fast-growing infrastructure company but not strong enough to erase cloud- reliability diligence. Security posture creates a second tension. ClickHouse Cloud was not vulnerable to the flagship 2025 RCE, which is a positive operational signal, but the OSS record still includes RCE, query-cache, and crash-class vulnerabilities that require customers and fork maintainers to stay current on patches. For a company selling into enterprise observability, data warehousing, and AI applications, reliability and patch discipline are part of product value, not back-office hygiene.[CR008, CR009, CR010, CR011, CR012, CR013]
| Risk | Jurisdiction or surface | Status / evidence | Likelihood | Severity | Mitigation / current posture | Residual exposure |
|---|---|---|---|---|---|---|
| Privacy and cross-border transfer compliance drift | EU / UK / US privacy regimes | ClickHouse cites U.S. DPF plus internal GDPR and CCPA programs; privacy obligations continue to evolve. | medium | high | SOC 2, ISO 27001, DPF, privacy policy, and enterprise compliance process are in place. | high — regulated buyers still need evidence that these controls are operating effectively by region and tier |
| Enterprise compliance concentration | HIPAA / PCI customers | HIPAA and PCI are Enterprise-plan features, implying a premium regulated cohort. | medium | medium-high | Upsell path and compliance controls exist for high-value buyers. | medium-high — losing or slowing regulated accounts would disproportionately hurt monetization quality |
| OSS vulnerability trust shock | Self-managed ClickHouse deployments | 2025 RCE and earlier crash / ACL issues remain on the public security changelog. | medium | high | Cloud was unaffected by CVE-2025-1385 and the project publishes fixes and advisories. | medium-high — self-managed incidents can still damage brand trust across the full platform |
| Query-cache authorization defect | Role-based access and row policies | GitHub advisory documents RBAC bypass when query cache is enabled and roles are switched under one user. | low-medium | high | Workaround is to avoid query cache in multi-role patterns or separate users by role. | medium — the issue is patched, but it demonstrates non-trivial security design complexity |
| Open-core governance backlash | OSS roadmap / contributor governance | Altinity says important capabilities are cloud-only and asks for clearer OSS governance. | medium | medium-high | Apache 2.0 core, large community, and multiple deployment modes still reduce immediate lock-in. | medium-high — roadmap tension can slow community goodwill and increase fork rhetoric |
| Fork maintenance burden after security events | Downstream forks and private variants | The CVE-2025-1385 advisory tells maintainers of forks to port fixes themselves. | medium | medium | Upstream publishes patches across supported releases. | medium — serious forks inherit patching burden that can fragment ecosystem trust |
Coverage is partial and limited to risks visible in public compliance, advisory, and governance materials as of 2026-05-27.
[CR010, CR011, CR012, CR013, CR014, CR015]| Risk | Mechanism | Evidence | Likelihood | Severity | Mitigation maturity | Residual exposure |
|---|---|---|---|---|---|---|
| Cloud uptime misses or noisy incident history | Availability slips undermine trust for observability, AI, and warehouse workloads that expect interactive performance. | Public status page showed 98.62% aggregate uptime for Feb-May 2026. | medium | high | moderate — status page and SLA process exist | medium-high |
| Security patching becomes part of product delivery | Repeated OSS CVEs force disciplined upgrades and can drag enterprise trust if fixes lag customer fleets. | Security changelog lists RCE, cache, and crash-class issues across recent releases. | medium | high | moderate — advisories and fixes are public | medium-high |
| Cloud-only features become support and migration burden | Managed conveniences may speed monetization but also create more operational surface that self-managed users expect to replicate. | Cloud page highlights ClickPipes, orchestration, backups, scaling, and patching as differentiators. | medium | medium-high | moderate | medium-high |
| Distributed-join reliability remains workload-sensitive | If buyers bring more normalized or join-heavy workloads, benchmark weaknesses can surface as cloud cost or performance incidents. | Exasol benchmark reported falling query completion as nodes increased on TPC-H style joins. | medium | medium-high | low-moderate — ClickHouse markets JOIN improvements but independent critiques persist | medium-high |
| Hyperscaler execution dependence | ClickHouse Cloud distribution and reliability depend on successful execution across AWS, GCP, and Azure footprints. | Cloud page says the service is on all three major cloud marketplaces. | medium | medium | moderate | medium |
| Operational complexity migrates to customers at scale | Even with automation, committed-spend users still need assurance around SLOs, tenancy, and support quality. | SLA language is limited to select committed-spend contracts rather than the whole funnel. | medium | medium-high | moderate | medium-high |
Likelihood and severity are qualitative and tie directly to public uptime, advisory, and cloud-operations disclosures.
[CR008, CR009, CR014, CR015, CR022, CR025]7.4 Customer concentration, GTM quality, and monitoring triggers
The final risk bucket is not whether ClickHouse has demand; it is whether the demand converts into durable, diversified, and profitable cloud revenue. Public evidence clearly supports a developer-led funnel: free credits, local modes, embedded options, OSS community scale, and easy experimentation are all designed to maximize adoption. That is a strength for product distribution, but it can also mask weak cohort economics if a relatively small number of enterprise customers account for a disproportionate share of paid cloud spend. Public materials describe more than 2,000 customers and list several marquee accounts, yet they do not show top-customer share, NRR, gross margin, or cloud-versus-self-managed conversion. That omission matters because premium features such as HIPAA, PCI, and SLA commitments appear to sit mostly with enterprise buyers. Investors therefore need to monitor not just adoption velocity but mix quality: whether large accounts are expanding, whether uptime holds as more critical workloads land, whether cloud-only features convert without alienating the OSS base, and whether competition from Snowflake, Databricks, StarRocks, or DuckDB compresses pricing or pushes ClickHouse into a narrower set of use cases than the current valuation assumes.[CR004, CR005, CR006, CR007, CR018, CR019]
| Risk | Counterparty or dependency | Mechanism | Evidence | Severity | Mitigation / diligence ask |
|---|---|---|---|---|---|
| Under-disclosed customer concentration | Top cloud customers and cohorts | A few large committed-spend customers could dominate paid cloud economics even though headline customer count is broad. | Public disclosures show >2,000 customers and SLA language for committed-spend contracts but no NRR or top-account mix. | high | Ask for top-10 customer contribution, NRR, gross retention, and sector concentration by ARR |
| Developer-led funnel monetizes unevenly | Self-serve and OSS users | High trial volume and local usage may not translate cleanly into durable cloud expansion. | Free 30-day trial, $300 credits, OSS core, local modes, and community-first messaging. | medium-high | Request conversion by cohort, payback by acquisition channel, and cloud attach rate from self-managed users |
| Hyperscaler route-to-market dependence | AWS, GCP, Azure marketplaces | Marketplace reach helps distribution but introduces platform, margin, and operational dependence. | Cloud page says ClickHouse Cloud is available on all three major cloud marketplaces. | medium | Review net revenue after marketplace fees and any concentration by cloud provider |
| Premium compliance revenue concentration | Regulated enterprise buyers | HIPAA, PCI, and likely deeper support commitments sit with enterprise plans, increasing dependence on larger accounts. | Compliance docs gate HIPAA and PCI behind Enterprise plan and SLAs behind committed-spend contracts. | medium-high | Request ARR mix by plan tier, regulated vertical exposure, and churn history for enterprise cohorts |
| Named-customer halo overstates diversification | Anthropic, Tesla, Meta, Sony, Instacart, others | Marquee logos can prove relevance while still masking a narrow revenue base. | Press release lists large customers but no revenue split by account, segment, or geography. | medium-high | Request top-customer share, top-20 share, and AI versus non-AI revenue mix |
This register is intentionally focused on monetization quality rather than raw logo count or download velocity.
[CR003, CR004, CR006, CR008, CR012, CR019]| Risk | Monitorable trigger | Threshold / event | Action implication |
|---|---|---|---|
| Path-to-profitability at 2025 valuation | Disclosure of ARR, gross margin, burn, or operating leverage | No credible unit-economics disclosure emerges by the next major financing or public-market event | Treat valuation as narrative-led and demand a larger margin of safety |
| Snowflake / Databricks displacement | Large enterprise wins against incumbent cloud warehouses | ClickHouse keeps being adopted only as a sidecar or narrow workload engine instead of a primary platform | Haircut terminal share assumptions and cloud expansion expectations |
| StarRocks and DuckDB substitution | Customer workload mix and benchmark chatter | Join-heavy workloads shift to StarRocks or embedded analytics stay on DuckDB without cloud upgrade | Reduce confidence in broad workload expansion and cloud attach-rate assumptions |
| Cloud execution quality | Public uptime, incident cadence, and security advisories | Status reliability weakens further or patch-related issues keep recurring in visible customer environments | Increase execution discount and require stronger operational evidence |
| OSS governance stability | Roadmap clarity and community tone | Cloud-only differentiation widens while contributor frustration or fork rhetoric accelerates | Model higher ecosystem risk and slower community-led adoption |
| Customer concentration | Top-account mix and cohort retention | A small set of enterprise accounts drives most ARR or retention deteriorates after trial conversion | Lower revenue-quality assumptions and reassess valuation support |
Each trigger is designed to be monitorable from either management disclosure or direct diligence rather than from product narrative alone.
[CR001, CR023, CR024, CR034, CR035, CR037]ClickHouse’s commercial model depends on an upgrade path from open-source and local modes into premium cloud tiers without losing users to alternatives.
This map shows commercial dependence, not legal control; alternative paths can either feed or weaken the cloud upgrade funnel.
[CR006, CR017, CR018, CR022, CR030, CR036]7.5 Exhibits
08Valuation
8.1 Recommendation: track — the product and open-source funnel are real, but the Series C already prices in elite execution
ClickHouse has many of the ingredients investors want in data infrastructure: a strong open-source engine, real customer pull in AI and observability workloads, and a cloud product whose usage-based model can compound as workloads scale. The issue is price, not quality. Independent coverage put the May 2025 Series C at about $6.35 billion, while third-party ARR estimates for the business cluster around roughly $150 million to $185 million, with Sacra’s central estimate at $160 million. That means the round was struck at roughly 35x to 42x trailing ARR. That premium is materially above Snowflake and MongoDB in public markets and still richer than Databricks despite Databricks’ far larger revenue base, broader platform breadth, and disclosed retention profile. The right call is track with medium confidence and high valuation risk: stay close, but require fresh evidence that ClickHouse can rapidly convert open-source demand into durable cloud economics before underwriting the headline mark.[CV004, CV006, CV009, CV012, CV014, CV029]
| decision field | current view | decision implication |
|---|---|---|
| Recommendation | track | Stay engaged, but do not treat the May 2025 price as obviously cheap based on public evidence. |
| Confidence | medium | Growth and adoption are well supported; ARR quality, retention, and margins are not publicly validated. |
| Risk rating | high | Valuation risk is high because the Series C was struck at roughly 35x-42x trailing ARR. |
| Valuation stance | rich but defensible only on a forward view | The mark can work if ClickHouse compounds ARR rapidly into the mid-$200M to $300M range. |
| Underwrite off | forward ARR rather than trailing ARR | Use milestone-based underwriting instead of relying on the 2025 headline valuation alone. |
| Upgrade trigger | buy only after proof | Require disclosure on retention, margins, and cloud conversion before moving from track to buy. |
This table reflects an investment judgment, not a generic product-quality score. The core debate is how much execution is already priced into the Series C mark.
[CV009, CV012, CV014, CV029, CV043, CV044]| argument | current evidence | what would change the view |
|---|---|---|
| Open-source adoption creates a low-friction funnel | Sacra cites ~46K GitHub stars, strong developer adoption, and 250%+ cloud ARR growth. | Show cohort conversion from open-source to paid cloud and sustained expansion within large enterprise accounts. |
| AI and real-time analytics are genuine tailwinds | ClickHouse won AI-native and observability logos that need low-latency analytical queries. | Evidence of durable budget ownership and repeatable large-account deployment would strengthen conviction. |
| Counter: the public comp gap is very large | ClickHouse’s trailing multiple sits above Snowflake, MongoDB, SingleStore, and even Databricks. | The gap narrows only if ARR scales rapidly enough for the current mark to become a forward high-teens or low-20s multiple. |
| Counter: platform breadth trails Databricks | Databricks disclosed 20,000+ organizations and >140% retention, which ClickHouse has not matched publicly. | A broader enterprise feature set and better disclosure would justify a narrower discount to Databricks. |
| Counter: disclosure is thin | ARR is estimated, while gross margin, NRR, and financing terms remain undisclosed in retained public sources. | Audited financials or lender-grade KPI disclosure would materially improve underwriting confidence. |
The thesis is strongest when framed around distribution and category fit; the anti-thesis is strongest when framed around the gap between narrative quality and public disclosure quality.
[CV010, CV011, CV019, CV020, CV029, CV034]The recommendation is driven by a tension: exceptional open-source distribution and cloud growth on one side, and a peer-multiple gap plus disclosure gaps on the other.
This figure expresses underwriting logic, not a process flow inside the product.
[CV011, CV012, CV014, CV029, CV037, CV043]8.2 Series C pricing context: $6.35B was a forward-underwriting price, not a public-comparable price
The May 2025 round itself was unquestionably real: ClickHouse raised $350 million, added a broad syndicate of blue-chip investors, and layered in a $100 million credit facility. What is harder to prove from public evidence is whether the company had already earned a mid-single-digit-billion valuation on current scale. ClickHouse disclosed more than 300% growth and more than 2,000 customers, while Sacra estimated 2025 annualized revenue at about $160 million and cloud ARR growth above 250% entering 2026. Those are excellent operating signals. Even so, the implied trailing multiple remains near 40x, and the underwriting range requested by the market still lands in the mid-30s to low-40s even if you give the company the benefit of a broader $150 million to $185 million ARR range. That means investors were effectively paying today for tomorrow’s ARR, not for a mature, filing-backed revenue base.[CV001, CV002, CV003, CV004, CV009, CV010]
On a trailing basis, ClickHouse’s Series C mark sits far above public and open-source comps and still above Databricks’ richer private multiple.
Multiples are simplified EV or market-cap-to-revenue bridges using publicly available valuation and revenue figures; private-company enterprise values may differ from headline equity valuations.
[CV013, CV014, CV017, CV019, CV024, CV027]8.3 Comparable set: Databricks is the aspiration, Snowflake and MongoDB are the public reality, and SingleStore is the closer private floor
The comp set makes the valuation tension clear. Snowflake generated $4.68 billion of FY2026 revenue and traded around a 13x revenue multiple in late May 2026. MongoDB, the most useful public open-source benchmark, traded closer to 10x. Databricks still commanded roughly 24.8x revenue, but it did so on a disclosed $5.4 billion revenue run-rate, more than 20,000 organizations on the platform, and net retention above 140%. SingleStore, by contrast, reported ARR above $123 million with near-breakeven cash flow, yet its last known disclosed valuation was only about $1 billion, or roughly 8x ARR on that simplified bridge. ClickHouse’s own ~40x trailing multiple therefore sits above every named peer. The only intellectually honest reason to pay that premium is belief that open-source adoption and AI-era workload growth will let ClickHouse move quickly toward Databricks-like forward scale rather than settle into Snowflake-, MongoDB-, or SingleStore-like pricing bands.[CV015, CV017, CV018, CV019, CV020, CV021]
| comparable | 2026 scale metric | valuation / multiple | relevance | limitation |
|---|---|---|---|---|
| ClickHouse Series C | ~$150M-$185M ARR range; 2,000+ customers | $6.35B; ~35x-42x ARR | The exact underwriting question at issue. | Revenue range is estimated rather than audited. |
| Databricks | $5.4B revenue run-rate; >65% growth | $134B; ~24.8x revenue | Best private high-growth data and AI platform comp. | Broader product surface and much larger customer base. |
| Snowflake | $4.68B FY2026 revenue | ~$61.55B market cap; ~13.1x revenue | Clean public cloud-warehouse comp. | Mature public company with lower growth and different governance constraints. |
| MongoDB | $2.46B FY2026 revenue | ~$24.74B market cap; ~10.0x revenue | Best public open-source premium benchmark. | Different workload mix and less direct overlap in observability and analytics. |
| SingleStore | >$123M ARR; near-breakeven cash flow | ~$1B last known valuation; ~8.1x ARR | Closer real-time database comp and useful private floor. | Valuation point is older and less liquid than a live financing mark. |
This is the full comp set explicitly selected for this chapter: ClickHouse’s own round anchor, one scaled private platform comp, two public comps, and one closer real-time-database private comp.
[CV012, CV014, CV017, CV019, CV024, CV027]8.4 Scenario analysis: the mark works only if ClickHouse can move from ~40x trailing ARR to a lower forward multiple quickly
The key valuation question is not whether ClickHouse is good. It is whether the company can compound fast enough to make a $6.35 billion entry look reasonable in hindsight. On peer multiples, the required scale jump is large. At Snowflake’s public multiple, ClickHouse would need roughly $485 million of ARR or revenue. At MongoDB’s multiple, it would need roughly $632 million. Even a Databricks-like premium still demands about $256 million. That is why the base case should be framed on forward ARR, not trailing ARR. If ClickHouse can convert its open-source funnel into enterprise cloud ARR and cross roughly $280 million to $320 million in the next 12 to 18 months, the current mark starts to look defensible. If growth stalls, if enterprise readiness lags, or if public comps de-rate further, downside is severe because there is so much air between the current multiple and public trading bands.[CV010, CV013, CV017, CV019, CV027, CV030]
| scenario | core assumptions | valuation logic | value range (USD billions) | probability signal |
|---|---|---|---|---|
| Bear | Growth slows toward public-comp trajectories, enterprise gaps persist, and public data-infra multiples soften further. | Low-teens comp frame closer to public open-source and warehouse peers. | 3.0-4.0 | Most likely if ARR growth decelerates before ClickHouse proves retention and margin quality. |
| Base | ClickHouse converts open-source demand into enterprise cloud ARR and reaches roughly $280M-$320M ARR within 12-18 months. | High-teens to ~20x forward ARR bridge. | 5.0-6.5 | Requires continued hypergrowth plus credible evidence on ARR quality and enterprise readiness. |
| Bull | ARR reaches roughly $375M-$430M quickly, the AI/open-source premium holds, and enterprise feature gaps close. | 20x-22x forward ARR for a breakout infrastructure winner. | 7.5-9.5 | Needs elite execution and continued premium market appetite for AI data infrastructure. |
Ranges are scenario-based valuation bridges, not DCF outputs. The base case treats the Series C as a forward-underwritten price rather than a current-scale price.
[CV010, CV031, CV032, CV033, CV038, CV039]| trigger | threshold or event | thesis transmission | action implication |
|---|---|---|---|
| Growth proof breaks | Management data no longer supports a path to ~$300M ARR within 12-18 months. | The current mark loses its forward-ARR justification. | Re-underwrite toward public open-source and warehouse multiples. |
| Retention or margin proof is weak | NRR or gross margin disclosure shows materially weaker economics than premium peers. | Premium growth quality was overstated. | Do not pay up for the 2025 headline; require a lower entry or step aside. |
| Enterprise feature gap persists | Security, governance, or compliance capabilities remain subscale for large regulated workloads. | Open-source adoption fails to convert into large durable cloud contracts. | Cut the premium and extend product and customer diligence. |
| Public comps de-rate further | Snowflake or MongoDB style multiples fall materially below current 10x-13x levels. | Exit math compresses even if ClickHouse executes well operationally. | Lower target entry and assume more conservative exit outcomes. |
| Governance or cap-table surprise | Preferences, debt covenants, or other senior claims materially distort common-equivalent value. | Headline valuation overstates investable economics. | Treat the round as less attractive than the headline suggests until documents are reviewed. |
These triggers are designed to be monitorable. Each one translates directly into either multiple compression, slower ARR growth, or worse exit economics.
[CV031, CV032, CV033, CV039, CV044, CV045]The current mark sits near the top of the base case and requires very strong forward ARR growth to avoid slipping into the bear range.
Values are expressed in USD millions and represent scenario-based valuation outcomes rather than discounted cash-flow estimates.
[CV031, CV032, CV033, CV038, CV039, CV045]ClickHouse scores very well on market and product factors, but much worse on disclosure quality and current entry valuation.
Scores are 0-10 underwriting judgments synthesized from the public evidence and are meant for internal investment-committee discussion.
[CV009, CV011, CV029, CV037, CV043, CV044]8.5 Exit readiness and diligence asks: the missing metrics are exactly the ones needed to justify the premium
The evidence gap is not around product relevance; it is around economic proof. Public sources do not provide audited ClickHouse financials, net retention, gross margin, or the conversion quality from open-source users to paid cloud cohorts. Nor do retained public sources disclose liquidation preferences or cap-table details that could change the economics of the Series C headline. That leaves the round looking more like a conviction bet on category leadership than a price backed by investor-complete disclosure. For that reason, the chapter’s diligence posture should stay focused on the metrics that could move the recommendation. A buy call would require audited or management-backed disclosure on ARR quality, retention, margin structure, and enterprise conversion. Without those, the stronger argument is to track the company closely, keep price discipline, and underwrite only after the business proves it can grow into the multiple rather than merely narrate its way toward it.[CV037, CV043, CV044, CV045]
| topic | missing evidence | why it matters | owner or diligence path |
|---|---|---|---|
| ARR quality | Cloud versus self-managed mix, customer cohorts, and top-account concentration. | Tests whether the open-source funnel is converting efficiently into paid cloud ARR. | Request a KPI pack and revenue bridge from management. |
| Retention and margin | NRR, gross margin, contribution margin, and payback by major customer cohort. | Determines whether ClickHouse deserves a durable premium over public peers. | Request lender-grade operating metrics or audit support. |
| Cap table and terms | Liquidation preferences, participation, and debt covenants tied to the Series C or later financings. | Headline valuation may overstate common-equivalent value. | Review financing documents and waterfall models. |
| Enterprise readiness | Evidence on security, governance, compliance, and large regulated wins. | Required to convert open-source popularity into durable enterprise cloud spend. | Run product diligence plus customer reference calls. |
| Exit path | Current IPO versus strategic-sale plan and sensitivity to public comp moves. | A venture return depends on the future multiple floor, not the present narrative alone. | Ask management and bankers for current exit framing and scenario sensitivity. |
The missing evidence is concentrated in economics, governance, and exit math rather than in product relevance. That is exactly why the recommendation stays at track.
[CV037, CV043, CV044, CV045]8.6 Exhibits
Disclaimer
This report is a public-evidence diligence snapshot, not investment advice. Important financial, legal, technical, and contractual facts remain non-public and should be verified directly with management and primary documents before any investment decision.
Evidence index
| ID | Statement | Confidence | Sources |
|---|---|---|---|
| CO001 | Work on ClickHouse began inside Yandex in 2009 as an effort to run analytical queries on real-time, non-aggregated data. | High | SO001, SO021 |
| CO002 | ClickHouse entered production in 2012 to power Yandex.Metrica. | High | SO001, SO021 |
| CO003 | ClickHouse was released as open-source software under the Apache 2.0 license in 2016. | High | SO001, SO021 |
| CO004 | ClickHouse, Inc. was formed in August 2021 as a Delaware corporation separate from the earlier Yandex project. | High | SO021, SO022 |
| CO005 | Aaron Katz is a co-founder and the CEO of ClickHouse. | Medium | SO001, SO013 |
| CO006 | Alexey Milovidov is a co-founder, CTO, and the original creator of ClickHouse. | High | SO001, SO021 |
| CO007 | Yury Izrailevsky is a co-founder and president of ClickHouse. | Medium | SO001, SO020 |
| CO008 | ClickHouse is a fast, open-source, column-oriented database management system built for real-time analytics. | High | SO003, SO005 |
| CO009 | The company commercializes the open-source core primarily through ClickHouse Cloud and related managed real-time analytics services. | Medium | SO003, SO011, SO019 |
| CO010 | ClickHouse Cloud entered early access in 2022. | Medium | SO001, SO011 |
| CO011 | ClickHouse says it has employees in over 10 countries and operates with a distributed-team model. | High | SO001, SO002 |
| CO012 | ClickHouse opened European offices in Amsterdam in 2022. | Medium | SO001, SO023 |
| CO013 | PitchBook currently labels ClickHouse as headquartered in San Francisco, CA, confirming a San Francisco Bay Area headquarters identity. | Medium | SO024, SO001 |
| CO014 | Public location records differ on exact Bay Area labeling: Colorado registration lists Portola Valley as the principal address while Craft lists Palo Alto and Amsterdam office locations. | Medium | SO022, SO023, SO024 |
| CO015 | PitchBook lists ClickHouse with 531 total employees. | Medium | SO024 |
| CO016 | Tracxn reports ClickHouse had 569 employees as of April 2026, supporting a public 500-plus headcount range. | Medium | SO025 |
| CO017 | ClickHouse raised a $50 million Series A in August 2021 led by Index Ventures and Benchmark. | High | SO021, SO018 |
| CO018 | ClickHouse raised a $250 million Series B at a $2 billion valuation in October 2021. | High | SO004, SO018 |
| CO019 | Series B participants included Index Ventures, Benchmark, Lightspeed, Redpoint, Almaz Capital, Yandex, FirstMark, and Lead Edge alongside lead investors Coatue and Altimeter. | High | SO004, SO018 |
| CO020 | Lightspeed identifies ClickHouse as a 2021 Series B portfolio investment. | Medium | SO020, SO018 |
| CO021 | Mike Volpi currently serves on ClickHouse's board. | Medium | SO027 |
| CO022 | Peter Fenton currently serves on ClickHouse's board. | Medium | SO028 |
| CO023 | ClickHouse raised a $350 million Series C in May 2025 led by Khosla Ventures. | High | SO005, SO026 |
| CO024 | Series C participants included BOND, IVP, Battery Ventures, Bessemer Venture Partners, Benchmark, Coatue, Lightspeed, FirstMark, GIC, and Nebius. | High | SO005, SO006, SO018 |
| CO025 | ClickHouse also secured a $100 million credit facility led by Stifel and Goldman Sachs alongside the Series C. | High | SO005, SO008, SO026 |
| CO026 | Third-party coverage after the May 2025 round placed ClickHouse's valuation at roughly $6.35 billion to $6.4 billion. | Medium | SO026, SO012 |
| CO027 | Public financing releases put ClickHouse's total funding above $650 million after the May 2025 Series C close. | High | SO005, SO026 |
| CO028 | ClickHouse disclosed more than 2,000 customers by May 2025. | Medium | SO005, SO010 |
| CO029 | Series C communications said the company grew over 300% during the prior year. | Medium | SO005, SO009 |
| CO030 | Sacra estimated ClickHouse reached about $160 million in ARR by the end of 2025, growing 256% year over year. | Medium | SO011 |
| CO031 | PitchBook News reported ClickHouse hit $100 million in annualized revenue in the first half of 2025 and that recurring revenue had roughly doubled over the preceding six months. | Medium | SO019 |
| CO032 | Public company and media sources place Anthropic, Tesla, Cisco, Sony, and other large enterprises among the visible customer and use-case footprint around ClickHouse. | Medium | SO005, SO012 |
| CO033 | ClickHouse added Kevin Egan as CRO, Mariah Nagy as VP People, and Jimmy Sexton as CFO in 2025. | Medium | SO010 |
| CO034 | In March 2022 ClickHouse said it had no operations in Russia, no Russian investors, and no Russian members of its board. | Medium | SO002 |
| CO035 | ClickHouse said it accelerated relocation of the original Russian engineering team to Amsterdam after the invasion of Ukraine, and PitchBook later reported Nebius retained warrants but no equity stake. | Medium | SO002, SO019 |
| CO036 | JFrog disclosed seven RCE and DoS vulnerabilities in ClickHouse DBMS. | High | SO015, SO016 |
| CO037 | Ubuntu's July 2024 security notice said older ClickHouse builds had heap overflows and possible arbitrary code execution risks including CVE-2021-43305. | High | SO016, SO015 |
| CO038 | A 2024 bug, CVE-2024-22412, allowed query cache to bypass role-based access controls until patched. | Medium | SO017 |
| CO039 | Public sources show open-source adoption scaling from more than 20,000 GitHub stars in 2021 to about 46,000 stars by early 2026. | Medium | SO004, SO011 |
| CO040 | Major scale milestones included the 2022 Amsterdam-office and cloud launch phase, the March 2025 HyperDX acquisition, and the May 2025 OpenHouse conference plus Series C. | Medium | SO001, SO012, SO025 |
| CO041 | Public directories sometimes date ClickHouse to 2009 or 2012 because they track the project origin, but official corporate chronology distinguishes the 2021 company formation from the earlier Yandex project. | Medium | SO001, SO022, SO024, SO025 |
| CO042 | Index says Mike Volpi and Peter Fenton joined Aaron Katz in negotiating a majority-controlled independent spin-out with Alexey Milovidov's team relocating to Amsterdam. | Medium | SO021 |
| CO043 | The investor map spans founding investors Index and Benchmark, 2021 growth investors such as Coatue, Altimeter, Lightspeed, and Almaz, 2025 growth investors led by Khosla, and debt providers Stifel and Goldman Sachs. | Medium | SO004, SO005, SO018, SO020 |
| CM001 | ClickHouse describes itself as a fast open-source column-oriented database management system for real-time analytical reports using SQL queries. | Medium | SM001 |
| CM002 | DB-Engines describes ClickHouse as a high-performance column-oriented SQL DBMS for OLAP that is available as both open-source software and a cloud offering. | Medium | SM006 |
| CM003 | ClickHouse's official product positioning spans real-time analytics, observability, data warehousing, and ML or GenAI workloads rather than one narrow software category. | High | SM001, SM003 |
| CM004 | The public GitHub repository describes ClickHouse as a real-time analytics database management system, reinforcing the project's developer-facing identity. | Medium | SM005 |
| CM005 | ClickHouse documentation lists five deployment modes: ClickHouse Server, ClickHouse Cloud, ClickHouse CLI, clickhouse-local, and chDB. | Medium | SM002 |
| CM006 | ClickHouse Server can be deployed locally, on-premises, or on major cloud providers including AWS, GCP, and Azure. | Medium | SM002 |
| CM007 | ClickHouse Cloud is the fully managed ClickHouse deployment mode that removes operational tasks such as updates, backups, scaling, and security patching. | High | SM002, SM003 |
| CM008 | ClickHouse Cloud is available on all three major cloud marketplaces, giving buyers a managed service option across AWS, GCP, and Azure. | High | SM002, SM003 |
| CM009 | ClickHouse Cloud markets compute-storage separation, pay-for-use compute, and lower replica overhead as core cost-efficiency features. | Medium | SM003 |
| CM010 | ClickHouse pricing emphasizes automatic scaling up and down, scaling unused resources to zero, and separate storage and compute billing. | Medium | SM009 |
| CM011 | ClickHouse's real-time analytics page emphasizes best-in-class query performance, continuous ingest, high query concurrency, and integration with common analytics tools. | Medium | SM010 |
| CM012 | ClickHouse's data warehousing page positions the product as a real-time data warehouse for business intelligence with faster queries at a fraction of the cost of traditional warehouse stacks. | Medium | SM011 |
| CM013 | ClickStack positions ClickHouse as an OpenTelemetry-native observability stack for logs, metrics, traces, session replays, and errors. | Medium | SM004 |
| CM014 | ClickStack claims 10-100x cost savings and sub-second queries on high-cardinality telemetry, directly targeting observability storage-cost pain. | Medium | SM004 |
| CM015 | ClickStack also offers a managed deployment path on ClickHouse Cloud for buyers who want observability without self-managing infrastructure. | Medium | SM004, SM003 |
| CM016 | ClickHouse's community page reports 12k+ Slack members, 2.9k+ contributors, 29k+ pull requests, 796 releases, and 47.6k+ GitHub stars. | High | SM001, SM007 |
| CM017 | ClickHouse's official adopters page documents a broad set of companies using ClickHouse and publishing success stories, supporting cross-vertical market applicability. | Medium | SM008 |
| CM018 | Altinity describes itself as the second-largest contributor to ClickHouse and highlights ecosystem tools such as the Kubernetes operator, clickhouse-backup, and a Grafana plugin, indicating non-vendor ecosystem depth around the project. | Medium | SM027 |
| CM019 | Mordor Intelligence estimates the cloud data warehouse market at USD 14.94 billion in 2026 and USD 49.12 billion by 2031 at a 26.86% CAGR. | Medium | SM012 |
| CM020 | Research and Markets values the cloud data warehouse market at USD 14.53 billion in 2026 and USD 31.7 billion by 2030 at a 21.5% CAGR. | Medium | SM013 |
| CM021 | Research and Markets identifies artificial intelligence, compute-storage separation, real-time data processing, and predictive or operational analytics as major cloud data warehouse trends. | Medium | SM013 |
| CM022 | MarketsandMarkets frames the cloud data warehouse market by application, vertical, deployment model, and type, corroborating that buyer budgets are segmented rather than monolithic. | Low | SM014 |
| CM023 | IndustryARC projects the cloud data warehouse market to reach $39.1 billion by 2026 at a 31.4% CAGR, a much more aggressive estimate than Mordor or Research and Markets. | Low | SM015 |
| CM024 | IndustryARC says large enterprises are the largest current cloud data warehouse buyers, IT and telecom is the highest-growth application segment, and North America holds a 41.5% share. | Medium | SM015 |
| CM025 | Grand View Research sizes the streaming analytics market at USD 23.4 billion in 2023 and USD 128.4 billion by 2030 at a 28.3% CAGR. | Medium | SM016 |
| CM026 | Grand View Research attributes streaming analytics growth to real-time forecasting, digitalization, and the spread of big data, IoT, and AI. | Medium | SM016 |
| CM027 | Grand View Research reports that hosted deployment held 51.7% of streaming analytics revenue in 2023, BSFI held 23.8%, fraud detection led applications at 18.8%, North America held 38.0% share, and Asia Pacific was the fastest-growing region at 32.0% CAGR. | Medium | SM016 |
| CM028 | Grand View Research estimates the observability tools and platforms market at USD 2.71 billion in 2023 and USD 5.40 billion by 2030 at a 10.7% CAGR. | Medium | SM017 |
| CM029 | Grand View Research says cloud deployment and large enterprises were the largest current observability segments, and that microservices, containers, and cloud-native complexity are major demand drivers. | Medium | SM017 |
| CM030 | MarketsandMarkets estimates the observability tools and platforms market at USD 2.4 billion in 2023 and USD 4.1 billion by 2028 at an 11.7% CAGR. | Medium | SM018 |
| CM031 | Mordor Intelligence estimates the observability market at USD 3.35 billion in 2026 and USD 6.93 billion by 2031 at a 15.62% CAGR. | Medium | SM019 |
| CM032 | BigQuery positions itself as a fully managed and completely serverless enterprise data warehouse with real-time analytics, built-in AI, vector and hybrid search, and decoupled storage and compute. | Medium | SM020 |
| CM033 | BigQuery pricing includes on-demand query pricing starting at USD 6.25 per TiB scanned and slot-based editions, showing that incumbent warehouse alternatives are sold on explicit usage economics rather than only seat-based pricing. | Medium | SM020 |
| CM034 | Datadog markets a unified observability platform that aggregates logs, metrics, traces, and real-time dashboards across modern infrastructure. | Medium | SM021 |
| CM035 | Datadog pricing separates ingest, indexing, flex storage, archiving, and AI or LLM observability products, demonstrating that observability buyers actively optimize retention tiers and telemetry cost. | Medium | SM022 |
| CM036 | Elastic positions observability as an AI-powered, OpenTelemetry-first platform that unifies logs, metrics, and traces in one system. | Medium | SM023 |
| CM037 | Elastic claims up to 65% storage reduction for logs, up to 50% TCO reduction for long-term log retention, and 40% better latency since January 2026, reinforcing how much observability deals turn on efficiency. | Medium | SM023 |
| CM038 | AWS OpenSearch Service combines managed and serverless deployment for search, observability, and vector database workloads, including log analytics, generative AI, and RAG use cases. | Medium | SM024 |
| CM039 | Grafana predicts that in 2026 unified observability becomes the default operating model, data value overtakes data volume, AI shifts from copilot to collaborator, and OpenTelemetry becomes the default standard. | Medium | SM025 |
| CM040 | IBM argues that 2026 observability strategy will center on AI-driven intelligence, cost management, and compatibility with open standards such as OpenTelemetry, Prometheus, and Grafana. | Medium | SM026 |
| CM041 | The most conservative current TAM floor relevant to ClickHouse already exceeds $10 billion because independent 2026 cloud data warehouse estimates cluster around $14.5-$14.9 billion. | Medium | SM012, SM013 |
| CM042 | No clean public source isolates the narrower market for real-time columnar OLAP databases specifically; warehouse, streaming analytics, and observability estimates overlap and should not be summed. | Medium | SM012, SM013, SM016, SM017, SM019 |
| CM043 | ClickHouse's best market description is analytical data infrastructure spanning warehouse, event analytics, and observability workloads rather than a single software line item. | Medium | SM001, SM003, SM004, SM011 |
| CM044 | BigQuery, Datadog, Elastic, and AWS OpenSearch all show that incumbent substitutes increasingly bundle AI, observability, and managed operations into integrated platforms, which raises switching costs for ClickHouse deals. | Medium | SM020, SM021, SM023, SM024 |
| CM045 | The natural ClickHouse adoption path splits between self-managed control and managed cloud convenience, letting the company sell to both sovereignty-sensitive and operations-sensitive buyers. | Medium | SM002, SM003, SM004, SM009 |
| CP001 | ClickHouse markets itself as the fastest open-source analytical database. | Medium | SP001 |
| CP002 | ClickHouse says it supports data warehousing, real-time analytics, observability, and ML or GenAI workloads in one engine. | Medium | SP001 |
| CP003 | ClickHouse Server can be run locally, in major public clouds, or on on-premises hardware, while the same engine also underpins ClickHouse Cloud. | Medium | SP003 |
| CP004 | ClickHouse Cloud is a fully managed service available on the three major cloud marketplaces. | Medium | SP004 |
| CP005 | ClickHouse public pricing emphasizes separate compute and storage, autoscaling, and scale-to-zero economics instead of fixed always-on capacity. | High | SP002, SP004 |
| CP006 | ClickHouse raised a $350 million Series C in May 2025, bringing total funding to more than $650 million. | Medium | SP006 |
| CP007 | ClickHouse said in May 2025 that it served more than 2,000 customers and had grown more than 300% over the prior year. | Medium | SP006 |
| CP008 | ClickHouse positions its benchmark program as public and reproducible, with head-to-head cost and performance comparisons against other cloud data platforms. | Medium | SP007 |
| CP009 | DB-Engines describes ClickHouse as a high-performance column-oriented SQL OLAP system that is available both as open-source software and as a cloud offering. | High | SP001, SP008 |
| CP010 | ClickHouse, Inc. incorporated in 2021 around a project that had already been open-sourced in 2016, so the commercial company is younger than the software community around it. | Medium | SP005 |
| CP011 | Snowflake presents itself as a fully managed multi-cloud service with cross-region operation and built-in governance and security features. | Medium | SP009 |
| CP012 | Snowflake cost architecture is broken into compute, storage, and data-transfer charges, with compute consumed as Snowflake credits. | High | SP011, SP012 |
| CP013 | Snowflake warehouses can start and stop automatically, resize up or down, and use per-second billing with a 60-second minimum each time a warehouse starts. | High | SP011, SP012 |
| CP014 | Snowflake disclosed $9.77 billion of remaining performance obligations, 790 Forbes Global 2000 customers, and 733 $1 million-plus customers as of January 31, 2026. | Medium | SP010 |
| CP015 | Databricks describes its lakehouse as one architecture for integration, storage, processing, governance, sharing, analytics, and AI across major clouds. | Medium | SP014 |
| CP016 | Databricks pricing materials publish undiscounted list prices and SKU groups rather than a single simple platform sticker price. | Medium | SP015 |
| CP017 | Databricks frames the lakehouse as an open architecture that combines the best elements of data lakes and data warehouses on open formats. | Medium | SP017 |
| CP018 | Databricks says more than 20,000 organizations, including 70% of the Fortune 500, rely on its platform and that it works with more than 1,200 global partners. | Medium | SP016 |
| CP019 | BigQuery positions itself as an autonomous data-to-AI platform with built-in predictive analytics, generative-AI functions, graph analysis, and vector or hybrid search. | Medium | SP018 |
| CP020 | BigQuery offers a free tier and a serverless pricing model whose primary components are compute and storage, with slot reservations for committed capacity. | High | SP018, SP019 |
| CP021 | Amazon Redshift is positioned as a cloud data warehouse for analytics and agentic-AI workloads, with zero-ETL integrations and a lakehouse path tied to AWS services. | Medium | SP020 |
| CP022 | Redshift public pricing starts at $0.543 per hour for provisioned deployment and $1.50 per hour for serverless deployment, with per-second billing and reservation savings. | Medium | SP021 |
| CP023 | Amazon Athena is positioned as a serverless SQL and Spark analytics layer over S3 and other cloud or on-premises data stores with support for open formats. | Medium | SP022 |
| CP024 | Athena pricing is framed as pay for data processed or compute used, which makes it attractive for intermittent AWS-native analytics use cases. | Medium | SP023 |
| CP025 | DuckDB is an embedded in-process OLAP database with no separate server process, strong portability, vectorized execution, and an MIT license. | Medium | SP024 |
| CP026 | DuckDB is optimized for local or embedded analytical workflows rather than a vendor-managed enterprise control plane. | Medium | SP024 |
| CP027 | StarRocks markets itself as one engine for real-time, lakehouse, and AI analytics with consistent performance at scale. | Medium | SP025 |
| CP028 | Apache Druid positions itself as a high-performance real-time analytics database that supports sub-second queries on streaming and batch data at scale. | Medium | SP026 |
| CP029 | Imply Enterprise is the commercial distribution of Druid and can be deployed on any cloud with management and support tooling. | Medium | SP027 |
| CP030 | Imply Polaris exposes public starter and standard pricing, beginning at $100 per month and $600 per month respectively, which is unusual price transparency for a real-time analytics DBaaS. | Medium | SP028 |
| CP031 | SingleStore Helios is a cloud database service for real-time transactional, analytical, and RAG-style workloads with separate storage and compute. | Medium | SP029 |
| CP032 | SingleStore public pricing is usage-based and illustrated with credits-per-hour and storage-charge examples rather than one fixed enterprise subscription. | Medium | SP030 |
| CP033 | SingleStore also supports self-managed deployment on bare metal, virtual machines, cloud hosts, Docker, and Kubernetes. | Medium | SP032 |
| CP034 | ClickHouse offers more deployment sovereignty than Snowflake, BigQuery, or Athena because it can be self-managed or consumed as a managed cloud service on top of the same analytical engine. | Medium | SP001, SP003, SP009, SP018, SP022 |
| CP035 | Hyperscaler-owned products such as BigQuery, Redshift, and Athena have materially stronger procurement and bundle leverage than ClickHouse because they ride Google Cloud and AWS account control, adjacent services, and annual-report-scale parents. | Medium | SP018, SP020, SP022, SP033, SP034 |
| CP036 | Snowflake and Databricks are the closest broad-platform competitors to ClickHouse because both pair analytics with wider governance or AI suites, while ClickHouse stays more concentrated on high-speed analytical serving workloads. | Medium | SP001, SP009, SP014 |
| CP037 | DuckDB and Druid are narrower substitutes than Snowflake or Databricks: DuckDB focuses on embedded local analytics and Druid focuses on streaming-first real-time analytics. | Medium | SP024, SP026 |
| CP038 | StarRocks and SingleStore overlap with ClickHouse on low-latency analytical serving, but StarRocks leans into lakehouse and AI SQL while SingleStore leans into HTAP-style application databases. | Medium | SP025, SP029, SP031 |
| CP039 | ClickHouse pricing is easier to reason about than Databricks SKU sheets, but AWS, Athena, Imply, and SingleStore publish more concrete public starting prices than ClickHouse does. | Medium | SP002, SP015, SP021, SP023, SP028, SP030 |
| CP040 | Snowflake pricing mechanics are more explicit around credits and warehouse sizes than ClickHouse pricing, which emphasizes autoscaling, compute-storage separation, and scale-to-zero behavior. | Medium | SP002, SP011, SP012 |
| CP041 | ClickHouse has a strong developer-facing open-source posture, while Snowflake, BigQuery, Redshift, Athena, and SingleStore present primarily as proprietary services or products. | Medium | SP001, SP008, SP009, SP018, SP020, SP022, SP029 |
| CP042 | BigQuery, Redshift, and Athena create the most immediate GTM and bundle risk to ClickHouse because they can be sold as one more service inside existing hyperscaler estates. | Medium | SP018, SP020, SP022, SP033, SP034 |
| CP043 | ClickHouse is best aligned to speed-sensitive analytical serving, observability, and real-time warehousing workloads rather than office-suite BI bundling or embedded local analytics. | Medium | SP001, SP004, SP006, SP024 |
| CP044 | ClickHouse is most vulnerable where buyers prioritize a broader governed data-and-AI suite from Snowflake or Databricks or default to native hyperscaler procurement. | Medium | SP009, SP014, SP018, SP020 |
| CP045 | Multi-homing is rational in this category because Snowflake warehouses, BigQuery serverless scans and slots, Redshift or Athena AWS-native workflows, DuckDB local analysis, and ClickHouse real-time serving optimize different jobs. | Medium | SP002, SP011, SP019, SP021, SP023, SP024 |
| CP046 | ClickHouse has enough funding and customer proof to matter globally, but its commercial reach is still smaller than Snowflake’s public scale and Databricks’ customer-plus-partner footprint. | Medium | SP006, SP010, SP016 |
| CP047 | ClickHouse’s open-source and self-hostable posture is a real differentiator for sovereignty-minded buyers, but the same openness means it must continue winning on performance and developer love rather than pure lock-in. | Medium | SP001, SP003, SP024, SP026 |
| CP048 | Among the named alternatives, ClickHouse, DuckDB, and Apache Druid most clearly surface as open-source projects in the reviewed materials, while Databricks emphasizes open formats more than an open-source core. | Medium | SP001, SP017, SP024, SP026 |
| CP049 | SingleStore and ClickHouse both sell separated storage and compute in cloud form, but SingleStore packages that around mixed transactional and analytical application needs while ClickHouse remains analytics-centric. | Medium | SP004, SP029, SP032 |
| CP050 | The main competitive risk to ClickHouse is not one knockout rival but a segmented market where Snowflake and Databricks win breadth, hyperscalers win procurement, and DuckDB, Druid, StarRocks, or SingleStore win narrower use cases. | Medium | SP009, SP014, SP018, SP020, SP022, SP024, SP025, SP026, SP029 |
| CP051 | ClickHouse Cloud’s architecture emphasizes object-backed parallel replicas and separate compute layers for read and write workloads. | Medium | SP004 |
| CP052 | Snowflake highlights encryption, RBAC, network policies, MFA, masking, and Horizon Catalog as key parts of its trust and governance posture. | Medium | SP009 |
| CP053 | Redshift claims up to 2.2x better price-performance and 7x better throughput than other cloud data warehouses, which shows how aggressively incumbents market analytical efficiency. | Medium | SP020 |
| CP054 | Druid says its architecture can support from hundreds to one hundred thousand queries per second, underscoring its relevance for high-concurrency real-time analytics. | Medium | SP026 |
| CI001 | ClickHouse raised $250 million in Series B at a $2 billion valuation on October 28, 2021. | High | SI010, SI012 |
| CI002 | The 2021 Series B followed an earlier roughly $50 million Series A raised in August 2021. | High | SI010, SI012 |
| CI003 | ClickHouse raised $350 million in Series C financing in May 2025 and said Khosla Ventures led the round. | High | SI005, SI006, SI013 |
| CI004 | The May 2025 Series C brought ClickHouse total disclosed funding to more than $650 million. | Medium | SI005, SI006, SI007 |
| CI005 | ClickHouse disclosed a $100 million credit facility alongside the Series C financing. | High | SI005, SI006, SI013 |
| CI006 | January 2026 reporting tied ClickHouse to a $15 billion valuation, roughly 2.5 times the $6.35 billion valuation reported for May 2025. | High | SI008, SI009 |
| CI007 | Company-linked Series C materials said ClickHouse grew over 300% during the year before the May 2025 financing announcement. | Medium | SI005, SI006 |
| CI008 | Company-linked May 2025 materials said ClickHouse served more than 2,000 customers. | Medium | SI005, SI006 |
| CI009 | By October 2025, ClickHouse said it had more than quadrupled ARR over the prior year while still exceeding 2,000 customers. | Medium | SI004, SI020 |
| CI010 | TechCrunch and Sacra both indicated ClickHouse Cloud ARR was growing more than 250% year over year going into 2026. | Medium | SI008, SI009 |
| CI011 | Sacra estimated ClickHouse reached about $160 million in annualized revenue in 2025, up 256% from a $45 million exit rate at the end of 2024. | Medium | SI009 |
| CI012 | A cautious public range for 2025 ARR or annualized revenue is roughly $150 million to $200 million. | Medium | SI008, SI009 |
| CI013 | ClickHouse monetizes primarily through managed cloud services while the core database remains open source and free to use. | High | SI009, SI010, SI012 |
| CI014 | ClickHouse Cloud entered public beta on AWS in October 2022. | High | SI011, SI014, SI015 |
| CI015 | ClickHouse Cloud became generally available on December 6, 2022. | High | SI003, SI012 |
| CI016 | The December 2022 GA release extended the free trial to 30 days, added low-monthly-spend Development Services, reduced production pricing, and improved compute metering. | High | SI003, SI012 |
| CI017 | Current ClickHouse public pages describe usage-based pricing with separate compute and storage scaling and pay-for-use economics. | High | SI001, SI002, SI009 |
| CI018 | TechCrunch described ClickHouse Cloud as a PLG motion where users can start with free credits and move to transparent consumption-based monthly billing. | High | SI001, SI012 |
| CI019 | m3ter said ClickHouse simplified beta pricing from read and write units to consumed storage plus compute before GA. | High | SI014, SI003 |
| CI020 | ClickHouse publicly markets monetization across real-time analytics, data warehousing, observability, and AI-powered data applications. | High | SI001, SI023, SI024, SI025 |
| CI021 | ClickPipes and other managed integrations create monetizable adjacencies inside ClickHouse Cloud beyond core query serving. | Medium | SI001, SI009, SI021 |
| CI022 | Dedicated and Bring-Your-Own-Cloud deployment options imply higher-ACV enterprise motions than the open-source core alone. | Medium | SI001, SI009 |
| CI023 | ClickHouse emphasizes compute-storage separation and autoscaling as cost-efficiency drivers relative to self-managed deployments. | High | SI001, SI009, SI012 |
| CI024 | Public sources do not disclose realized enterprise pricing, discount levels, or the actual mix between cloud, support, and adjacent products. | High | SI001, SI002, SI009 |
| CI025 | Public sources do not disclose ClickHouse gross margin, net revenue retention, or churn. | Medium | SI001, SI009 |
| CI026 | Public sources do not disclose cash on hand, monthly burn, or remaining runway. | High | SI005, SI009, SI013 |
| CI027 | Public sources disclose the existence of a $100 million facility but not the draw, covenant, rate, or maturity details needed for full debt underwriting. | High | SI005, SI013 |
| CI028 | ClickHouse said in 2025 that it hired Jimmy Sexton as CFO, suggesting a more mature finance function. | Medium | SI004 |
| CI029 | AI-native customers are central to the current ClickHouse Cloud growth narrative. | Medium | SI004, SI005, SI008 |
| CI030 | ClickStack extends ClickHouse monetization into managed observability workloads on the cloud platform. | High | SI001, SI024 |
| CI031 | Real-time analytics and data warehousing remain the clearest publicly marketed workload categories and likely core revenue drivers. | High | SI001, SI023, SI025 |
| CI032 | The free open-source core functions as top-of-funnel freemium acquisition for the managed cloud offering rather than a separately disclosed revenue line. | Medium | SI009, SI010, SI012 |
| CI033 | By October 2021 ClickHouse pointed to more than 20,000 GitHub stars and 800-plus contributors, indicating a large open-source community funnel. | Medium | SI010 |
| CI034 | TechCrunch reported that early cloud traction already exceeded 100 customers around the December 2022 GA launch. | Medium | SI012 |
| CI035 | Snowflake, MongoDB, Confluent, and Elastic all show current annual report cycles through SEC EDGAR, offering public financial transparency that ClickHouse does not match as a private company. | High | SI016, SI017, SI018, SI019 |
| CI036 | Those public-company filing trails provide a more transparent benchmark for discussing category economics and risk than ClickHouse currently offers. | Medium | SI016, SI017, SI018, SI019 |
| CI037 | ClickHouse faces pricing pressure because buyers can compare managed cloud against both self-hosted open source and incumbent analytics platforms. | Medium | SI009, SI016 |
| CI038 | Sacra highlighted a downside case in which incumbent platforms such as Snowflake and BigQuery narrow ClickHouse performance differentiation. | Medium | SI009 |
| CI039 | The disclosed funding stack shows strong capital access, but the absence of current liquidity data prevents a firm runway view. | Medium | SI005, SI008, SI009, SI013 |
| CI040 | Series C proceeds were earmarked for product development, global expansion, and AI-native customer partnerships. | High | SI005, SI006, SI013 |
| CI041 | Current ClickHouse pages still frame free-trial credits and pay-for-use economics as customer-acquisition levers in 2026. | High | SI001, SI023, SI025 |
| CI042 | As of the 2026 run date, the best public picture is strong growth and fundraising paired with opaque unit economics and liquidity. | Medium | SI008, SI009, SI013 |
| CI043 | The October 2025 Series C extension indicates investors were still willing to add capital after the initial May 2025 round. | Medium | SI004, SI013, SI020 |
| CI044 | Expansion across AWS, GCP, and Azure marketplaces broadens enterprise distribution, but public sources still do not disclose CAC or payback. | Medium | SI001, SI009 |
| CE001 | ClickHouse is positioned publicly as both an open-source column-oriented OLAP DBMS and a managed cloud offering. | Medium | SE002, SE016, SE019 |
| CE002 | ClickHouse Cloud is a fully managed service where infrastructure, maintenance, scaling, and operations are handled by ClickHouse. | Medium | SE002 |
| CE003 | ClickHouse Cloud is presented as available on all three major cloud marketplaces/providers. | Medium | SE001, SE021 |
| CE004 | ClickHouse Cloud supports centralized marketplace billing across AWS, Azure, and GCP subscriptions. | Medium | SE010 |
| CE005 | The 2026 cloud changelog names AWS Mexico, Azure Australia East, and GCP London as supported ClickHouse Cloud regions or region additions. | Medium | SE010 |
| CE006 | The cloud overview describes ClickHouse Cloud as providing serverless operations, autoscaling, backups, replication, and high availability. | Medium | SE002 |
| CE007 | ClickHouse says its cloud architecture separates storage and compute with pay-for-use compute scaling. | Medium | SE001 |
| CE008 | ClickHouse Cloud uses object-backed parallel replicas in a shared-nothing architecture to reduce storage duplication and network overhead. | Medium | SE001 |
| CE009 | The Shared database engine works with Shared Catalog to manage databases whose tables use stateless engines such as SharedMergeTree. | Medium | SE006 |
| CE010 | The Shared database engine removes local-disk dependency by storing metadata in a central versioned state that compute nodes fetch on startup. | Medium | SE006 |
| CE011 | The VLDB architecture overview says ClickHouse uses a vectorized query execution engine with optional code compilation. | Medium | SE003 |
| CE012 | The development architecture page says ClickHouse processes data as arrays or chunks and dispatches operations on arrays whenever possible. | Medium | SE004 |
| CE013 | ClickHouse documents its engine as layered into query processing, storage, integration, and access/control components. | Medium | SE003 |
| CE014 | MergeTree-family engines are designed for high ingest rates and large data volumes by creating parts that are merged in the background. | Medium | SE005 |
| CE015 | MergeTree primary keys index blocks of rows called granules rather than individual rows, and the default index granularity is 8192. | Medium | SE005 |
| CE016 | ClickHouse documents sparse primary indexes as memory-efficient for very large tables but notes that a range read can still pull up to index_granularity*2 extra rows per data block. | Medium | SE005 |
| CE017 | ClickHouse explicitly links better compression to sorting data by a consistent primary key. | Medium | SE005 |
| CE018 | The architecture overview says ClickHouse integrates external databases, Kafka and RabbitMQ, lakehouse table formats, and object storage through its integration layer. | Medium | SE003 |
| CE019 | ClickPipes is described as a Cloud-only managed ingestion engine for Kafka, S3, PostgreSQL, MongoDB, GCS, MySQL, and other sources. | Medium | SE001 |
| CE020 | The Kafka engine documentation recommends ClickPipes on ClickHouse Cloud for private networking, independent scaling, and monitoring of Kafka ingestion. | Medium | SE007 |
| CE021 | The Kafka table engine supports configurable consumers, security protocols, schema-aware parsing, and materialized-view based streaming pipelines. | Medium | SE007 |
| CE022 | The dbt-clickhouse adapter supports table, view, incremental, materialized-view, tests, snapshots, and seeds workflows. | Medium | SE008 |
| CE023 | The dbt adapter also exposes ClickHouse-specific codecs, TTLs, indexes, and projections and documents CI/CD patterns for staging and production. | Medium | SE008 |
| CE024 | ClickHouse organizes integrations into core, partner, and community tiers rather than presenting every connector as first-party. | Medium | SE009 |
| CE025 | The Microsoft Power Query connector is GA and supports Power BI semantic models, Dataflows, and Fabric Dataflow Gen2. | Medium | SE020 |
| CE026 | ClickHouse’s Azure GA announcement highlights turnkey integrations with Power BI, Azure Event Hubs, and Azure Blob Storage. | Medium | SE021 |
| CE027 | The Cloudflare Logpush integration guide says ClickPipes can ingest from S3 with exactly-once semantics and replay capability. | Medium | SE011 |
| CE028 | The cloud overview lists GDPR, HIPAA, ISO 27001, ISO 27001 SoA, PCI DSS, and SOC 2 among ClickHouse Cloud compliance programs. | Medium | SE002 |
| CE029 | The cloud overview lists SSO, multi-factor authentication, RBAC, Private Link, Private Service Connect, IP filtering, and CMEK as cloud security controls. | Medium | SE002 |
| CE030 | The Azure GA announcement says ClickHouse Cloud is built with network isolation, traffic encryption, and multi-availability-zone replication. | Medium | SE021 |
| CE031 | The 2026 changelog shows organization spend alerts, dual-window autoscaling, and primary-service idling reaching GA or active rollout during 2026. | Medium | SE010 |
| CE032 | Index sharding entered private preview in 2026 to distribute index analysis across replicas, cut per-replica memory, and improve query performance. | Medium | SE010 |
| CE033 | The cloud changelog says BYOC on GCP became GA and ClickPipes reached AWS region parity in 2026. | Medium | SE010 |
| CE034 | The public GitHub repo showed roughly 47.6k stars, 8.4k forks, and 796 releases on the access date. | Medium | SE014 |
| CE035 | The documentation repo page exposes active maintenance metadata, watchers, and multi-language docs contributions. | Medium | SE015 |
| CE036 | PyPI listed clickhouse-connect 1.1.0, released on 2026-05-26, with Python 3.10-3.14 support plus async, SQLAlchemy, Pandas, and Superset capabilities. | Medium | SE017 |
| CE037 | The npm package page showed @clickhouse/client 1.11.1 with 503,589 weekly downloads and documented streaming support for Node.js and browser clients. | Medium | SE018 |
| CE038 | Docker Hub showed more than 100 million pulls for clickhouse/clickhouse-server and documented the HTTP 8123 and native 9000 interfaces. | Medium | SE019 |
| CE039 | DB-Engines describes ClickHouse as a high-performance column-oriented SQL OLAP DBMS available as open source and as a cloud offering. | Medium | SE016 |
| CE040 | DB-Engines lists access methods including HTTP REST, JDBC, ODBC, and PostgreSQL/MySQL-compatible wire protocols. | Medium | SE016 |
| CE041 | A TrustRadius reviewer credits MergeTree, primary-key data skipping, and compression for strong query performance and cites one workflow improving from more than two minutes to under one second. | Medium | SE022 |
| CE042 | The same TrustRadius review flags gaps in SQL console query-plan UX, cloud role granularity, and some SSO IdP compatibility. | Medium | SE022 |
| CE043 | ClickHouse’s Cloudflare meetup recap says Cloudflare ran ClickHouse in production by the end of 2016 and later exceeded 1,000 active replicas. | Medium | SE013 |
| CE044 | The Cloudflare recap says the deployment processed hundreds of millions of inserted rows per second and maintained 184 dictionaries. | Medium | SE013 |
| CE045 | Cloudflare said moving dictionaries from hashed to hashed-array layouts reduced memory footprints by more than 4x. | Medium | SE013 |
| CE046 | HypeQuery argues that scaled ClickHouse deployments commonly add query translation and semantic layers because optimized schemas are too cognitively expensive for broad self-service use. | Medium | SE023 |
| CE047 | HypeQuery cites Uber, Cloudflare, Instacart, Microsoft, GitLab, Lyft, and Contentsquare as converging on similar abstraction stacks above ClickHouse. | Medium | SE023 |
| CE048 | ClickHouse’s adopters page shows production use across observability, SEO, blockchain, cloud data platforms, and security-related workloads. | Medium | SE012 |
| CE049 | ClickHouse’s user stories and analytics video position the product in observability, customer-facing analytics, and other large-scale analytics workflows. | Medium | SE024, SE025 |
| CE050 | Microsoft’s connector documentation requires an ODBC driver and, for cloud service scenarios, an on-premises data gateway to bridge ClickHouse into Power BI services. | Medium | SE020 |
| CE051 | The cloud overview positions the SQL console and clickhousectl CLI as part of the managed cloud operating surface. | Medium | SE002 |
| CE052 | ClickHouse Cloud documents compute-compute separation, with independent compute layers for read and write workloads. | Medium | SE001 |
| CU001 | ClickHouse's current public proof base spans observability, product analytics, AI and LLM operations, fintech, education, mobility, and industrial analytics rather than a single vertical. | High | SU001, SU032 |
| CU002 | The public customer-story set visibly over-indexes toward observability and real-time analytics workloads rather than general-purpose enterprise warehousing. | Medium | SU001 |
| CU003 | In most public ClickHouse deployments, engineering, platform, SRE, or data teams are the buyers and operators, while broader end users consume governed dashboards, APIs, or product surfaces rather than raw ClickHouse directly. | Medium | SU001, SU016, SU018, SU020 |
| CU004 | The official adopter list and external customer lists include Cloudflare, Contentsquare, eBay, Spotify, Uber, and ByteDance, but the quality of evidence differs materially by logo. | Medium | SU002, SU028 |
| CU005 | Cloudflare has used ClickHouse in production since late 2016 and had exceeded 1,000 active replicas by 2023. | Medium | SU004 |
| CU006 | Cloudflare uses ClickHouse for HTTP analytics, DNS analytics, logging analytics, Workers runtime analysis, internal analytics, customer dashboards, Firewall Analytics, and Cloudflare Radar. | High | SU004, SU005, SU006 |
| CU007 | Cloudflare's 2025 public demo showed ClickHouse scanning 96 trillion events over one hour and 1.61 quadrillion events over one day in under two seconds. | Medium | SU003 |
| CU008 | Cloudflare's 2018 HTTP analytics pipeline used ClickHouse to support analytics on traffic running at roughly 6 million requests per second. | Medium | SU005 |
| CU009 | Cloudflare's 2022 log analytics migration to ClickHouse stored 100% of events and reduced inserter CPU and memory consumption by eight times compared with the prior Elasticsearch-centered setup. | Medium | SU006 |
| CU010 | Cloudflare's 2026 Ready-Analytics platform uses ClickHouse for millions of calls per day in usage billing, powers hundreds of millions of dollars in usage revenue and fraud workflows, and had grown to more than 2 PiB with millions of rows per second of ingest. | Medium | SU007 |
| CU011 | Contentsquare migrated its main SaaS analytics product from Elasticsearch to ClickHouse and reported 11x lower infrastructure cost, 10x p99 query improvement, and expansion of retention from one month-equivalent historical access to 13 months. | High | SU010, SU009 |
| CU012 | Contentsquare built AST-based query optimization on top of ClickHouse, and an external architecture roundup describes this abstraction layer as driving 10x speedups on the slowest 5% of queries. | High | SU010, SU009 |
| CU013 | OpenAI ingests petabytes of log data per day, says volume is growing by more than 20% per month, and runs its ClickHouse observability system with 90 shards and two replicas before adding a third query replica. | Medium | SU011 |
| CU014 | After GPT-4o image generation launched in March 2025, OpenAI experienced a 50% overnight log-volume spike and then achieved a 40% CPU reduction from a one-line Bloom-filter optimization in ClickHouse. | Medium | SU011 |
| CU015 | The accessible public proof for Uber is mainly secondary architecture coverage and older adopter-list references rather than a fresh customer-authored 2026 engineering case study. | Medium | SU009, SU002, SU029 |
| CU016 | The accessible public proof for Spotify is mainly an adopter-list entry tied to older slides rather than a fresh detailed deployment write-up. | Medium | SU002, SU030 |
| CU017 | LeadCognition lists ByteDance alongside Cloudflare, Uber, eBay, and Spotify as a production ClickHouse user, but the fetched proof quality is lower than current flagship case studies. | Low | SU028 |
| CU018 | Anthropic says ClickHouse played an instrumental role in shipping Claude 4 and now operates a custom air-gapped version of the ClickHouse Cloud architecture inside its secure compute environment. | Medium | SU012 |
| CU019 | Tesla's Comet platform on ClickHouse currently ingests tens of millions of rows per second and survived a one-billion-rows-per-second load test running for 11 days. | Medium | SU013 |
| CU020 | The same Tesla load test crossed one quadrillion rows without reported instability, making it one of the strongest public proofs of ClickHouse's telemetry-scale ceiling. | Medium | SU013 |
| CU021 | Microsoft Clarity launched publicly with ClickHouse at its core and says the system now supports millions of projects, hundreds of trillions of events, and hundreds of petabytes of data. | High | SU014, SU001 |
| CU022 | Microsoft Clarity says heat map generation moved from a roughly 30-minute offline workflow to an instantaneous task after choosing ClickHouse. | Medium | SU014 |
| CU023 | Replo's official customer story describes an analytics system trusted by more than 4,000 Shopify merchants that processes more than 100 billion events and ingests 3,000 to 5,000 events per second. | Medium | SU015 |
| CU024 | Mintlify says ClickHouse reduced dashboard latency from tens of seconds to sub-one-second, removed weekly analytics bug reports, improved estimated NPS by roughly 30%, and cut cost by around 60% versus PostHog. | Medium | SU016 |
| CU025 | Padlet reports around 40 million monthly unique users, usage in 242 of 246 countries, roughly 8 billion events in a month, 45 millisecond median query latency, and 690 millisecond p99 latency on its ClickHouse analytics layer. | Medium | SU017 |
| CU026 | Buildkite Test Engine grew from roughly 3 billion to 12 billion test executions per month in six months, now stores about 70 billion records in ClickHouse, peaks above 25,000 events per second, and saves eight dollars elsewhere for each dollar spent on ClickHouse. | Medium | SU018 |
| CU027 | Ramp says it serves more than 50,000 customers and found that 16,000 randomized queries across 60 million transactions completed in about 12 seconds on ClickHouse, while enterprise reports that timed out after 40 seconds on Postgres returned in milliseconds after migration. | Medium | SU019 |
| CU028 | Qonto serves more than 600,000 small businesses and freelancers across eight countries and says ClickHouse stores 231 TB of uncompressed span attributes in 376 GB, implying a 99.84% compression ratio and about $70,000 of annual storage savings. | Medium | SU020 |
| CU029 | Langfuse says its newer immutable Events table on ClickHouse uses roughly three times less memory and produces up to 20 times faster queries than its older mutable model. | Medium | SU021 |
| CU030 | Lyft's ClickHouse Cloud deployment reads more than 450 TB per day, writes about 4 TB per day, and handles hundreds of queries per second on average with peaks in the thousands. | Medium | SU022 |
| CU031 | Polymarket says it implemented a ClickHouse data warehouse in weeks and now serves its leaderboard API at 100s of requests per second with average latency around 25 milliseconds. | Medium | SU023 |
| CU032 | ClickHouse Cloud markets separate storage and compute, pay-for-use pricing, major cloud-marketplace availability, and reduced shard and replica management overhead. | High | SU024, SU025 |
| CU033 | Open-source availability and SQL compatibility are repeatedly cited in public materials and reviews as reducing vendor lock-in and lowering the friction of evaluation or migration. | Medium | SU011, SU027 |
| CU034 | TrustRadius reviewers praise ClickHouse's MergeTree performance and real-time warehouse utility but flag limitations in SQL-console features, cloud role granularity, and SSO support. | Medium | SU026 |
| CU035 | PeerSpot gives ClickHouse an average rating of 8.6 out of 10 and highlights speed, scalability, compression, and lack of vendor lock-in as strengths. | Medium | SU027 |
| CU036 | PeerSpot review summaries also surface recurring complaints around documentation, UI and security/admin maturity, setup complexity, and cloud pricing visibility. | Medium | SU027 |
| CU037 | Public proof shows broad logo diversity across AI, fintech, developer tools, education, mobility, and industrial analytics, which reduces obvious single-vertical concentration risk. | Medium | SU001, SU032 |
| CU038 | ClickHouse does not publicly disclose customer NRR, GRR, logo churn, renewal rates, or top-customer ARR share in the fetched materials, leaving durability and concentration as unresolved diligence items. | Medium | SU001, SU026, SU027, SU024 |
| CU039 | Public customer stories repeatedly show a land-and-expand motion in which ClickHouse lands as the fix for one urgent analytics or observability bottleneck and then expands into adjacent product or operational use cases. | Medium | SU010, SU016, SU018, SU019, SU020, SU023 |
| CU040 | Evidence quality splits into high-confidence customer-quoted production case studies, medium-confidence secondary architecture summaries, and low-confidence adopter-list or logo-directory proof, so marquee-logo claims should be weighted accordingly. | High | SU003, SU009, SU002, SU028 |
| CR001 | In May 2025 ClickHouse announced a $350 million Series C round at a $6.35 billion valuation. | Medium | SR001, SR002 |
| CR002 | The same financing package brought ClickHouse total funding to over $650 million and included a $100 million credit facility led by Stifel and Goldman Sachs. | High | SR001, SR002, SR003 |
| CR003 | ClickHouse said it grew over 300% during the prior year and now serves more than 2,000 customers. | Medium | SR001, SR002 |
| CR004 | The May 2025 financing announcement named Anthropic, Tesla, Mercado Libre, Sony, Meta, Memorial Sloan Kettering, Lyft, and Instacart as customers or recent wins. | Medium | SR001, SR002 |
| CR005 | ClickHouse Cloud is marketed as a fully managed serverless service with pay-for-use compute and autoscaling. | High | SR004, SR005 |
| CR006 | ClickHouse Cloud publicly offers a 30 day trial with $300 credits, reinforcing a self-serve developer acquisition motion. | High | SR004, SR013, SR014 |
| CR007 | The pricing page says ClickHouse separately meters storage and compute, scales unused resources toward zero, and lets customers set autoscaling limits to contain bill shock. | Medium | SR005 |
| CR008 | ClickHouse documents SLAs only for select committed-spend contracts rather than for every cloud user. | Medium | SR006 |
| CR009 | The public status page reported 98.62% aggregate uptime for February through May 2026 while AWS components showed 100% uptime. | Medium | SR007 |
| CR010 | ClickHouse Cloud says it has maintained SOC 2 Type II since 2022 and ISO 27001 since 2023. | Medium | SR008 |
| CR011 | ClickHouse says it self-certified to the U.S. Data Privacy Framework in 2024 and maintains internal GDPR and CCPA compliance programs. | High | SR008, SR009 |
| CR012 | HIPAA support is available only on the Enterprise plan and PCI service-provider compliance was added in 2025. | Medium | SR008 |
| CR013 | Privacy and compliance disclosures imply that ClickHouse must continuously manage cross-border data-transfer and privacy obligations for cloud customers. | Medium | SR008, SR009 |
| CR014 | ClickHouse documented CVE-2025-1385 as a route to arbitrary code execution on misconfigured self-managed servers that enable library bridge plus file upload paths. | High | SR019, SR020, SR022 |
| CR015 | Both ClickHouse’s security changelog and its GitHub advisory say ClickHouse Cloud was not vulnerable to CVE-2025-1385. | Medium | SR019, SR020 |
| CR016 | A GitHub advisory describes a query-cache bug in which switching roles under a single user can bypass role-based access controls and expose unauthorized rows. | Medium | SR021, SR019 |
| CR017 | ClickHouse offers cloud, server, local CLI, clickhouse-local, and embedded chDB deployment modes on the same core engine. | Medium | SR012 |
| CR018 | ClickHouse’s product and community pages advertise roughly 2.9k contributors, 29k pull requests, and 47.6k GitHub stars. | High | SR011, SR015 |
| CR019 | The community page says ClickHouse has more than 12k product makers on Slack, underscoring a developer-led funnel rather than a pure top-down enterprise motion. | Medium | SR015 |
| CR020 | Altinity argues that important capabilities such as SharedMergeTree, lightweight updates, and S3 role-based access are now cloud-only, creating open-core and fork risk for the ecosystem. | Medium | SR023 |
| CR021 | The same Altinity analysis says community trust is strained by an unclear OSS roadmap and by the ClickHouse team acting as a pull-request bottleneck. | Medium | SR023 |
| CR022 | ClickHouse Cloud markets ClickPipes and several managed integrations as cloud-only conveniences that do not exist in the same form for self-managed users. | Medium | SR004 |
| CR023 | In DB-Engines May 2026 rankings, Snowflake was #6 and Databricks #7 while ClickHouse ranked #26. | Medium | SR026 |
| CR024 | The same ranking placed DuckDB at #44 and StarRocks at #142, showing that both alternatives continue to accumulate mindshare even from much smaller bases. | Medium | SR026 |
| CR025 | Exasol’s February 2026 benchmark found ClickHouse’s successful query runtime degraded 1.39x between 1 and 16 concurrent streams. | Medium | SR024 |
| CR026 | The same benchmark found ClickHouse query success fell from 82% at one node to 36% at eight nodes on TPC-H style distributed tests because large joins hit per-shard memory limits. | Medium | SR024 |
| CR027 | Exasol’s benchmark says DuckDB still degrades 41% at 16 concurrent streams but remains attractive where single-process simplicity matters more than cluster concurrency. | Medium | SR024 |
| CR028 | A StarRocks-sponsored benchmark article claimed ClickHouse could not complete its TPC-H test set and that StarRocks was 2.2x faster on wide-table SSB queries. | Low | SR025 |
| CR029 | ClickHouse’s own product page explicitly pitches migrations from Snowflake, Redshift, Elastic, and Druid, confirming that incumbent warehouse and analytics budgets are the primary target market. | Medium | SR011 |
| CR030 | ClickHouse Cloud says it is available on all three major cloud marketplaces and handles updates, backups, scaling, and security patches automatically. | Medium | SR004 |
| CR031 | ClickHouse claims its lower cloud cost comes from compute-storage separation, autoscaling, object-backed parallel replicas, and lower replica overhead. | Medium | SR004, SR005 |
| CR032 | ClickHouse argues that many teams move from open source to ClickHouse Cloud because they do not want to manage shards, replicas, upgrades, or security patches themselves. | Medium | SR004, SR012 |
| CR033 | Use-case pages position high concurrency, millions of rows per second, and interactive dashboards as core reasons to adopt ClickHouse Cloud. | Medium | SR013, SR014 |
| CR034 | The public financing disclosures emphasized customers and growth but did not publish ARR, revenue, gross margin, or profitability metrics. | High | SR001, SR002, SR003 |
| CR035 | Because the May 2025 $6.35 billion mark came without public unit-economics disclosure, valuation underwriting depends on future cloud monetization and margin capture that outsiders cannot yet verify. | Medium | SR001, SR004, SR005 |
| CR036 | The combination of free trial credits, local deployment options, and an open-source core creates a product-led funnel that can expand adoption faster than it converts to durable paid cloud spend. | Medium | SR004, SR012, SR013, SR015 |
| CR037 | Public materials disclose more than 2,000 customers and committed-spend SLAs but no top-customer share, cohort retention, or NRR, leaving concentration risk materially under-documented. | Medium | SR001, SR006 |
| CR038 | Enterprise-only HIPAA and PCI features imply that regulated high-spend customers likely matter disproportionately to cloud monetization. | Medium | SR004, SR008 |
| CR039 | Offering the same engine across OSS, local, embedded, and fully managed modes reduces raw lock-in for buyers but increases cannibalization and upgrade-path risk for the paid cloud business. | Medium | SR012, SR017, SR023 |
| CR040 | DB-Engines describes ClickHouse as both an Apache 2.0 open-source product and a cloud offering with row policies, quotas, resource limits, and multiple wire protocols. | Medium | SR027 |
| CR041 | Public uptime documentation and aggregate status metrics show that ClickHouse Cloud reliability remains a live execution variable rather than a solved background condition. | Medium | SR006, SR007 |
| CR042 | ClickHouse’s security changelog records repeated crash, ACL, and RCE-class issues across recent OSS releases, making patch discipline part of the enterprise trust story. | High | SR019, SR022 |
| CR043 | The CVE-2025-1385 advisory explicitly tells maintainers of forked ClickHouse versions to port the fix themselves, raising maintenance burden for any serious fork. | Medium | SR020 |
| CR044 | TechCrunch framed ClickHouse as a direct Snowflake and Databricks challenger, reinforcing that investor expectations are tied to taking share from much larger data-platform incumbents. | Medium | SR029 |
| CR045 | The Tinybird alternatives page and ClickHouse’s own deployment docs show developers have multiple ways to use ClickHouse-compatible or adjacent analytics stacks without defaulting to ClickHouse Cloud. | Low | SR012, SR030 |
| CR046 | ClickHouse’s real-time analytics, warehousing, and adopters surfaces show broad workload coverage that expands addressable market but also broadens the product and support execution surface. | Medium | SR013, SR014, SR016 |
| CR047 | The combination of strong community breadth and visible roadmap tension means governance missteps would be amplified across a large developer base rather than staying a niche issue. | Medium | SR015, SR023 |
| CR048 | The cleanest public diligence asks are revenue quality by cohort, top-customer exposure, cloud gross margin, incident history by service tier, and an explicit OSS-versus-cloud roadmap split. | Medium | SR001, SR006, SR007, SR023 |
| CV001 | ClickHouse raised $350 million in a Series C round on May 29, 2025 led by Khosla Ventures. | High | SV001, SV013 |
| CV002 | The Series C included BOND, IVP, Battery Ventures, Bessemer Venture Partners, and existing investors Index Ventures, Lightspeed, GIC, Benchmark, Coatue, FirstMark, and Nebius, taking total funding to more than $650 million. | High | SV001, SV014 |
| CV003 | ClickHouse also secured a $100 million credit facility led by Stifel and Goldman Sachs alongside the Series C financing. | Medium | SV001, SV003 |
| CV004 | At the time of the round, ClickHouse said it had grown more than 300% over the prior year and served more than 2,000 customers. | High | SV001, SV004 |
| CV005 | ClickHouse describes itself as an open-source columnar database management system built for real-time analytics and large-scale analytical workloads. | High | SV001, SV008 |
| CV006 | ClickHouse Cloud monetizes through usage-based pricing with separate compute and storage charges rather than fixed-seat software pricing. | High | SV006, SV007 |
| CV007 | ClickHouse Cloud emphasizes pay-for-use compute, separate storage, and managed autoscaling as core commercial mechanics. | High | SV006, SV007 |
| CV008 | ClickHouse says its cloud architecture separates storage and compute and can offer faster warm-up and better economics than Snowflake for real-time analytics use cases. | Medium | SV007, SV009 |
| CV009 | Sacra estimated that ClickHouse reached about $160 million in annualized revenue in 2025. | Medium | SV011, SV012 |
| CV010 | Sacra reported that ClickHouse Cloud ARR was growing more than 250% year over year as of January 2026. | Medium | SV011, SV012 |
| CV011 | Sacra said ClickHouse had roughly 46,000 GitHub stars and broad open-source adoption across user-facing analytics products by February 2026. | Medium | SV012 |
| CV012 | Independent coverage pegged ClickHouse’s May 2025 Series C at approximately $6.35 billion post-money. | Medium | SV003 |
| CV013 | A $6.35 billion valuation on $160 million ARR implies about 39.7x trailing ARR. | Medium | SV003, SV012 |
| CV014 | Using a $150 million to $185 million ARR underwriting range, the Series C valuation implies roughly 34.3x to 42.3x ARR. | Medium | SV003, SV012 |
| CV015 | Snowflake generated $4.68 billion of revenue in fiscal year 2026. | High | SV021, SV022 |
| CV016 | Snowflake’s market capitalization was about $61.55 billion in late May 2026. | Medium | SV019, SV020 |
| CV017 | Snowflake traded at roughly 13.1x revenue in May 2026 based on a $61.55 billion market cap and $4.68 billion FY2026 revenue. | Medium | SV019, SV020 |
| CV018 | Databricks announced a $5.4 billion revenue run-rate and an approximately $134 billion valuation in February 2026. | High | SV015, SV016 |
| CV019 | Databricks’ February 2026 financing implied an enterprise value to revenue multiple of about 24.8x. | Medium | SV015, SV016 |
| CV020 | Databricks disclosed more than 65% year-over-year growth, net retention above 140%, and more than 20,000 organizations on the platform. | Medium | SV015, SV018 |
| CV021 | SingleStore reported ARR above $123 million in Q2 fiscal 2026, up 23% year over year. | High | SV026, SV028 |
| CV022 | SingleStore ended Q2 fiscal 2026 with more than $150 million in cash, zero debt, and free cash flow nearly breakeven over the prior twelve months. | High | SV026, SV028 |
| CV023 | Tracxn lists SingleStore’s last known valuation at $1 billion as of October 3, 2022. | Medium | SV027 |
| CV024 | A $1 billion valuation on $123 million ARR implies an approximate 8.1x ARR multiple for SingleStore. | Medium | SV026, SV027 |
| CV025 | MongoDB generated $2.46 billion of revenue in fiscal year 2026. | Medium | SV023 |
| CV026 | MongoDB’s market capitalization was about $24.74 billion in late May 2026. | Medium | SV023, SV024 |
| CV027 | MongoDB traded at roughly 10.0x revenue in May 2026. | Medium | SV023, SV024 |
| CV028 | DB-Engines ranked MongoDB fifth, Snowflake sixth, and ClickHouse twenty-sixth in its May 2026 popularity table, highlighting ClickHouse’s smaller installed-base footprint. | Medium | SV025 |
| CV029 | ClickHouse’s implied Series C multiple sits above Snowflake’s ~13x, MongoDB’s ~10x, and SingleStore’s ~8x, and even above Databricks’ ~25x despite materially smaller scale. | Medium | SV003, SV015, SV019, SV023, SV026, SV027 |
| CV030 | The gap between ClickHouse’s ~40x trailing ARR and Databricks’ ~25x suggests investors were underwriting extraordinary forward growth rather than current scale parity. | Medium | SV012, SV015, SV016 |
| CV031 | At Snowflake’s ~13.1x revenue multiple, ClickHouse would need roughly $485 million of ARR or revenue to justify a $6.35 billion valuation. | Medium | SV003, SV019, SV020 |
| CV032 | At Databricks’ ~24.8x multiple, ClickHouse would need roughly $256 million of ARR or revenue to justify $6.35 billion. | Medium | SV003, SV015, SV016 |
| CV033 | At MongoDB’s ~10.0x multiple, ClickHouse would need roughly $632 million of ARR or revenue to justify $6.35 billion. | Medium | SV003, SV023, SV024 |
| CV034 | Open-source distribution gives ClickHouse a premium narrative because free adoption can feed ClickHouse Cloud, but public open-source comp MongoDB shows the premium normalizes near ~10x once scale matures. | Medium | SV011, SV012, SV023, SV024 |
| CV035 | ClickHouse’s AI and observability customer mix shows real demand from sophisticated users, but those logos do not yet prove public-company durability on retention, margins, or workload concentration. | Medium | SV001, SV011 |
| CV036 | Reuters noted that software stocks were under pressure in 2026 over fears that fast-moving AI could disrupt software economics, reinforcing public-market multiple compression risk. | Medium | SV016 |
| CV037 | Because ClickHouse’s public revenue evidence is third-party-estimated rather than audited, the Series C price has less objective support than Databricks’ official run-rate disclosure or Snowflake’s filing-backed revenue base. | Medium | SV012, SV015, SV022 |
| CV038 | A bullish underwriting case requires ClickHouse to keep converting open-source adoption into enterprise cloud ARR and to cross roughly $300 million ARR quickly so the current mark moves from ~40x trailing to ~20x forward. | Medium | SV012, SV015, SV019 |
| CV039 | A bearish underwriting case assumes growth slows toward public and open-source comps, which could compress valuation toward a low-teens revenue multiple and materially below the Series C mark. | Medium | SV016, SV019, SV023 |
| CV040 | SingleStore’s near-breakeven profile at a much lower implied multiple shows investors pay very different prices for real-time database vendors once hypergrowth cools. | Medium | SV026, SV027, SV028 |
| CV041 | Databricks’ richer valuation is tied not only to growth but also to broader platform breadth, more than 20,000 organizations, and a disclosed retention profile, all of which ClickHouse has not matched publicly. | Medium | SV015, SV018 |
| CV042 | ClickHouse’s company materials argue that real-time analytics and lower cost per query versus legacy warehouses are central to the product moat. | Medium | SV009, SV010 |
| CV043 | The available evidence supports a track recommendation: product-market pull and open-source distribution are strong, but the May 2025 price already capitalized much of the next leg of execution. | Medium | SV003, SV012, SV015, SV019 |
| CV044 | A move from track to buy would require audited financials or management disclosure on ARR quality, retention, gross margin, and the economics of converting large open-source users to paid cloud. | Medium | SV012, SV022 |
| CV045 | Thesis-break triggers are a sharp deceleration below the growth needed for ~$300 million ARR, failure to close enterprise feature gaps, or public comps de-rating further from the current 10x to 25x range. | Medium | SV016, SV019, SV023 |
| ID | Publisher | Title | Quote |
|---|---|---|---|
| SO001 | ClickHouse | Our Story - ClickHouse | Work on ClickHouse began in 2009 ... 2016 ClickHouse releases as an open-source project under the Apache 2 license ... 2021 ClickHouse, Inc. incorporates in Delaware, with our headquarters in the San Francisco Bay Area. |
| SO002 | ClickHouse | We Stand With Ukraine | ClickHouse, Inc. is a Delaware company with headquarters in the San Francisco Bay Area. We have no operations in Russia, no Russian investors, and no Russian members of our Board of Directors. |
| SO003 | GitHub | GitHub - ClickHouse/ClickHouse: ClickHouse® is a real-time analytics database management system | |
| SO004 | Business Wire | ClickHouse Raises $250M Series B To Scale Groundbreaking OLAP Database Management System Globally | ClickHouse ... has raised $250 million in Series B funding at a $2 billion valuation. The investment was led by Coatue and Altimeter, with participation from Index Ventures, Benchmark, Lightspeed, Redpoint, Almaz, Yandex N.V., FirstMark and Lead Edge. |
| SO005 | Business Wire | ClickHouse Raises $350 Million Series C to Power Analytics for the AI Era | ClickHouse ... has raised $350 million in Series C financing. The round was led by Khosla Ventures ... Today's round follows earlier investments of over $300 million, bringing total funding to over $650 million. |
| SO006 | FirstMark | ClickHouse Raises $350 Million Series C to Power Analytics for the AI Era | |
| SO007 | Coatue | Our Partnership with ClickHouse: Powering Analytics for the AI Era | |
| SO008 | Goodwin | Goodwin Advises ClickHouse on $350 Million Series C Financing and Extension to Accelerate Growth in Real-Time Analytics for the AI Era | |
| SO009 | Silicon Valley Daily | ClickHouse Clicks With $350 Million Series C Round | |
| SO010 | Business Wire via Ritzau | ClickHouse Extends Series C Financing and Expands Leadership Team to Fuel Growth | Business Wire | The company bolstered its leadership team with three key hires. In July, Kevin Egan joined as Chief Revenue Officer ... In August, Mariah Nagy came on board as Vice President of People ... Jimmy Sexton joined as Chief Financial Officer. |
| SO011 | Sacra | ClickHouse at $160M ARR | Sacra estimates ClickHouse hit $160M ARR by the end of 2025, growing 256% year-over-year. |
| SO012 | Forbes | ClickHouse | Company Overview & News | ClickHouse creator Alexey Milovidov cofounded the Portola Valley, California-based company in 2021 with ex-Salesforce exec Aaron Katz and ex-Netflix exec Yury Izrailevsky. |
| SO013 | Craft | ClickHouse CEO and Key Executive Team | Craft.co | |
| SO014 | Unify | Employee Data and Trends for ClickHouse | Unify | |
| SO015 | JFrog | 7 RCE and DoS vulnerabilities Found in ClickHouse DBMS | The JFrog Security research team ... discovered seven new security vulnerabilities in ClickHouse DBMS. |
| SO016 | Ubuntu | USN-6933-1: ClickHouse vulnerabilities | Ubuntu security notices | Ubuntu | It was discovered that ClickHouse incorrectly handled memory, leading to a heap-based buffer overflow ... or execute arbitrary code. (CVE-2021-43305) |
| SO017 | DEV Community | Lessons Learned #2: Your new feature could introduce a security vulnerability to your old feature (Clickhouse CVE-2024-22412) | |
| SO018 | Tracxn | ClickHouse - Funding Rounds & List of Investors | |
| SO019 | PitchBook News | Big Data wars: ClickHouse has a playbook to beat Snowflake at its own game | This past May, it raised a mammoth $350 million round led by Khosla Ventures ... and the company just hit $100 million in annualized revenue. |
| SO020 | Lightspeed Venture Partners | ClickHouse | |
| SO021 | Index Ventures | The Fast and the Furious: How ClickHouse, the world's fastest open-source database, is creating the first real-time data warehouse | In August of 2021, they announced the incorporation of ClickHouse, Inc., along with $50 million in Series A funding led by Index Ventures and Benchmark. |
| SO022 | Colorado Department of State via OpenGovCO | ClickHouse, Inc. · 4113 Alpine Rd, Portola Valley, CA 94028 | The entity was formed on August 25, 2021 in the jurisdiction of Delaware. The registered office location is at 4113 Alpine Rd, Portola Valley, CA 94028. |
| SO023 | Craft | ClickHouse Corporate Headquarters, Office Locations and Addresses | Craft.co | |
| SO024 | PitchBook | ClickHouse 2026 Company Profile: Valuation, Funding & Investors | PitchBook | ClickHouse is headquartered in San Francisco, CA. ClickHouse has 531 total employees. |
| SO025 | Tracxn | ClickHouse Company Profile | ClickHouse has 569 employees as of Apr 26. |
| SO026 | AIM Media House | ClickHouse Raises $350M at $6.35B: Future of Analytics | ClickHouse has now confirmed a $350 million Series C at a valuation of $6.35 billion. |
| SO027 | Index Ventures | Index Ventures: Mike Volpi | Index Ventures | He's currently serving on the boards of Aurora, ClickHouse, Cockroach Labs, Cohere, Confluent, Covariant.ai, Kong, Scale, Sonos, and Wealthfront. |
| SO028 | Forbes | Peter Fenton | He currently serves on the boards of Airtable, ClickHouse, Cockroach Labs, Docker, CarbonDrop, Mercor, Sorare, Timescale, and Wildlife Studios. |
| SM001 | ClickHouse | Fast Open-Source OLAP DBMS - ClickHouse | ClickHouse is a fast open-source column-oriented database management system that allows generating analytical data reports in real-time using SQL queries. |
| SM002 | ClickHouse Docs | Deployment modes | ClickHouse Docs | ClickHouse Server can be installed locally, deployed to AWS GCP or Azure, or run on on-premises hardware; ClickHouse Cloud is the fully managed deployment mode. |
| SM003 | ClickHouse | ClickHouse Cloud | Cloud Based DBMS | ClickHouse | ClickHouse Cloud is the fastest, most cost-efficient way to build real-time analytics, observability, and AI-powered data applications, and is available on all three major cloud marketplaces. |
| SM004 | ClickHouse | ClickStack: High-Performance Open Source Observability | Logs, Metrics, Traces with ClickHouse | ClickStack is open source observability for OpenTelemetry at scale with sub-second queries, 10-100x cost savings, and logs metrics traces and session replays powered by ClickHouse. |
| SM005 | GitHub | GitHub - ClickHouse/ClickHouse: ClickHouse® is a real-time analytics database management system | |
| SM006 | DB-Engines | ClickHouse System Properties | ClickHouse is a high-performance column-oriented SQL DBMS for OLAP and is available as both open-source software and a cloud offering. |
| SM007 | ClickHouse | Welcome to the ClickHouse Community | The ClickHouse community page reports 12k+ Slack members, 2.9k+ contributors, 29k+ PRs, 796 releases, and 47.6k+ GitHub stars. |
| SM008 | ClickHouse Docs | ClickHouse adopters | ClickHouse Docs | A list of companies using ClickHouse and their success stories. |
| SM009 | ClickHouse | ClickHouse Pricing | ClickHouse pricing automatically scales compute up and down, scales unused resources down to zero, and separates storage and compute. |
| SM010 | ClickHouse | Real-time Analytics with ClickHouse | ClickHouse highlights continuous ingest, high query concurrency, and low-latency analytics for interactive apps and dashboards. |
| SM011 | ClickHouse | Data warehousing with ClickHouse | ClickHouse positions itself as a real-time data warehouse for BI with faster queries at a fraction of the cost. |
| SM012 | Mordor Intelligence | Cloud Data Warehouse Market Share & Size 2031 Outlook | The Cloud Data Warehouse Market worth USD 14.94 billion in 2026 is growing at a CAGR of 26.86% to reach USD 49.12 billion by 2031. |
| SM013 | Research and Markets | Cloud Data Warehouse Market Report 2026 - Research and Markets | The Cloud Data Warehouse Market, valued at USD 14.53B in 2026, is projected to reach USD 31.7B by 2030, growing at a 21.5% CAGR. |
| SM014 | MarketsandMarkets | Cloud Data Warehouse Market Share, Forecast | Growth Analysis & Opportunities | The global market for cloud data warehouse is categorized by application, vertical, deployment model, type, and region. |
| SM015 | IndustryARC | Cloud Data Warehouse Market size, Industry outlook, Market forecast, Demand Analysis, Market Share, Market Report 2021-2026 | Cloud Data Warehouse Market is forecast to reach $39.1 billion by 2026 after growing at a CAGR of 31.4% during 2021-2026. |
| SM016 | Grand View Research | Streaming Analytics Market Size | Industry Report, 2030 | The global streaming analytics market was valued at USD 23.4 billion in 2023 and is projected to reach USD 128.4 billion by 2030 at a CAGR of 28.3%. |
| SM017 | Grand View Research | Observability Tools And Platforms Market Size Report, 2030 | The global observability tools and platforms market size was estimated at USD 2.71 billion in 2023 and is projected to reach USD 5.40 billion by 2030 at a CAGR of 10.7%. |
| SM018 | MarketsandMarkets | Observability Tools and Platforms Market Size & Trends, Growth Analysis, Industry Forecast [2030] | The global observability tools and platforms market is projected to grow from USD 2.4 billion in 2023 to USD 4.1 billion by 2028 at a CAGR of 11.7%. |
| SM019 | Mordor Intelligence | Observability Market Size, Report, Share & Competitive Landscape 2031 | The Observability Market worth USD 3.35 billion in 2026 is growing at a CAGR of 15.62% to reach USD 6.93 billion by 2031. |
| SM020 | Google Cloud | BigQuery | AI data platform | EDW | BigQuery is Google Cloud's fully managed and completely serverless enterprise data warehouse with real-time analytics, built-in AI, and decoupled storage and compute. |
| SM021 | Datadog | Infrastructure & Application Monitoring as a Service | Datadog | Datadog presents a unified observability platform that aggregates metrics events logs and traces and supports real-time interactive dashboards. |
| SM022 | Datadog | Pricing | Datadog | Datadog pricing breaks observability into ingest, indexing, storage, archiving, and AI or LLM observability products, underscoring how buyers manage telemetry cost by tier. |
| SM023 | Elastic | Full-stack observability solution — built on the Elasticsearch Platform | Elastic describes an AI-powered OpenTelemetry-first observability platform with best-in-class efficiency for logs and metrics and one platform for everything. |
| SM024 | Amazon Web Services | Open Source Search Engine - Amazon OpenSearch Service - AWS | Amazon OpenSearch Service simplifies AI-powered search, observability, and vector database operations with both managed clusters and serverless deployment. |
| SM025 | Grafana Labs | 2026 observability trends and predictions from Grafana Labs | Grafana Labs | Grafana argues that in 2026 unified observability becomes the default operating model, data value overtakes data volume, AI becomes a collaborator, and OpenTelemetry becomes the default. |
| SM026 | IBM | Observability Trends 2026 | IBM | IBM argues that 2026 observability strategies must become more intelligent, cost-effective, and compatible with open standards as AI adoption grows. |
| SM027 | Altinity | Ecosystem Projects | Altinity describes itself as the second-largest contributor to ClickHouse and highlights open-source tools including the Kubernetes operator, clickhouse-backup, and a Grafana plugin. |
| SP001 | ClickHouse | Real-Time Data Analytics Platform | ClickHouse | The fastest open-source analytical database. |
| SP002 | ClickHouse | ClickHouse Pricing | We scale storage and compute separately, due to our flexible architecture. |
| SP003 | ClickHouse | Deployment modes | ClickHouse Docs | ClickHouse Server can be installed on your local machine... deployed to major cloud providers... or set up on your own on-premises hardware. |
| SP004 | ClickHouse | ClickHouse Cloud | Cloud Based DBMS | ClickHouse | ClickHouse Cloud: the fastest, most cost-efficient way to build real-time analytics, observability, and AI-powered data applications. |
| SP005 | ClickHouse | Our Story - ClickHouse | 2021: ClickHouse, Inc. incorporates in Delaware... |
| SP006 | Business Wire | ClickHouse Raises $350 Million Series C to Power Analytics for the AI Era | ClickHouse, Inc. ... has raised $350 million in Series C financing. |
| SP007 | ClickHouse | ClickHouse benchmarks: Performance, cost & scalability compared | All proven by benchmarks that can be reproduced by anyone. |
| SP008 | DB-Engines | ClickHouse System Properties | A high-performance, column-oriented SQL DBMS for online analytical processing. |
| SP009 | Snowflake | The Snowflake Platform | Snowflake supports multi-cloud and cross-region operations. |
| SP010 | Snowflake | Snowflake - Investor Relations | 790 Forbes Global 2000 Customers ... 733 $1M+ Customers ... 125% Net Revenue Retention Rate. |
| SP011 | Snowflake | Understanding overall cost | Snowflake Documentation | The total cost of using Snowflake is the aggregate of the cost of using data transfer, storage, and compute resources. |
| SP012 | Snowflake | Overview of warehouses | Snowflake Documentation | Snowflake utilizes per-second billing (with a 60-second minimum each time the warehouse starts). |
| SP013 | U.S. Securities and Exchange Commission | EDGAR Entity Landing Page | |
| SP014 | Databricks | Data Lakehouse Architecture | Databricks | One architecture for integration, storage, processing, governance, sharing, analytics and AI. |
| SP015 | Databricks | Databricks Pricing: Flexible Plans for Data and AI Solutions | The Price List displays Databricks' undiscounted price for each SKU. |
| SP016 | Databricks | About Databricks: The data and AI company | More than 20,000 organizations worldwide ... and 70% of the Fortune 500 rely on the Databricks Data Intelligence Platform. |
| SP017 | Databricks | What Is a Lakehouse? | A lakehouse is a new, open architecture that combines the best elements of data lakes and data warehouses. |
| SP018 | Google Cloud | BigQuery | AI data platform | EDW | BigQuery is the autonomous data to AI platform. |
| SP019 | Google Cloud | BigQuery | BigQuery is a serverless data analytics platform. |
| SP020 | AWS | Cloud Data Warehouse - Amazon Redshift - AWS | Amazon Redshift is built on cloud economics that scale with your usage. |
| SP021 | AWS | Amazon Redshift Pricing | Redshift Provisioned starts at $0.543 per hour, while Redshift Serverless begins at $1.50 per hour. |
| SP022 | AWS | Interactive SQL - Amazon Athena - AWS | Get streamlined, near-instant startup of SQL or Apache Spark analytics workloads with a serverless experience. |
| SP023 | AWS | Amazon Athena Pricing | Pricing is simple: you pay based on data processed or compute used. |
| SP024 | DuckDB Foundation | Why DuckDB | DuckDB does not run as a separate process, but completely embedded within a host process. |
| SP025 | StarRocks | StarRocks | A High-Performance Analytical Database | One Engine for Real-Time, Lakehouse, and AI. |
| SP026 | Apache Druid | Apache Druid | Apache® Druid | A high performance, real-time analytics database that delivers sub-second queries on streaming and batch data at scale and under load. |
| SP027 | Imply | Imply Enterprise - Imply | Commercial distribution of Druid. |
| SP028 | Imply | Imply Database as a Service Cost | Real-Time Analytics Database-a-Service Cost | Starter ... Starts at $100/month. Standard ... Starts at $600/month. |
| SP029 | SingleStore | Product Overview | SingleStore Helios cloud service | SingleStore Helios is a cloud database-as-a-service available on leading public clouds. |
| SP030 | SingleStore | SingleStore Pricing | The cost of SingleStore is determined by actual usage. |
| SP031 | SingleStore | About SingleStore | SingleStore brings you the world’s fastest distributed SQL database for real-time applications and analytics. |
| SP032 | SingleStore | Deploy · SingleStore Self-Managed Documentation | SingleStore can be deployed on bare metal, on virtual machines, or in the cloud. |
| SP033 | Alphabet | Alphabet Investor Relations - Investors | |
| SP034 | Amazon | Annual reports, proxies and shareholder letters | |
| SI001 | ClickHouse | ClickHouse Cloud | Cloud Based DBMS | ClickHouse | |
| SI002 | ClickHouse | ClickHouse Pricing | |
| SI003 | ClickHouse Docs | Cloud changelog - 2022 | ClickHouse Docs | |
| SI004 | ClickHouse | ClickHouse Extends Series C Financing and Expands Leadership Team to Fuel Growth | |
| SI005 | Yahoo Finance | ClickHouse Raises $350 Million Series C to Power Analytics for the AI Era | |
| SI006 | FirstMark | ClickHouse Raises $350 Million Series C to Power Analytics for the AI Era | |
| SI007 | FinSMEs | ClickHouse Raises $350M in Series C Funding | |
| SI008 | TechCrunch | Snowflake, Databricks challenger ClickHouse hits $15B valuation | TechCrunch | |
| SI009 | Sacra | ClickHouse revenue, funding & news | |
| SI010 | Business Wire | ClickHouse Raises $250M Series B To Scale Groundbreaking OLAP Database Management System Globally | |
| SI011 | Business Wire | Announcing ClickHouse Cloud: Democratizing lightning-fast insights and analytics | |
| SI012 | TechCrunch | ClickHouse launches ClickHouse Cloud, extends its Series B | TechCrunch | |
| SI013 | Goodwin | Goodwin Advises ClickHouse on $350 Million Series C Financing and Extension to Accelerate Growth in Real-Time Analytics for the AI Era | News & Events | Goodwin | |
| SI014 | m3ter | The ClickHouse Story with m3ter | |
| SI015 | App Developer Magazine | ClickHouse Cloud beta released on AWS | |
| SI016 | Snowflake via SEC EDGAR | EDGAR Search Results | |
| SI017 | MongoDB via SEC EDGAR | EDGAR Search Results | |
| SI018 | Confluent via SEC EDGAR | EDGAR Search Results | |
| SI019 | Elastic via SEC EDGAR | EDGAR Search Results | |
| SI020 | Business Wire | ClickHouse Extends Series C Financing and Expands Leadership Team to Fuel Growth | |
| SI021 | ClickHouse Docs | ClickHouse Docs | ClickHouse Docs | |
| SI022 | ClickHouse Docs | Changelog 2026 | ClickHouse Docs | |
| SI023 | ClickHouse | Real-time Analytics with ClickHouse | |
| SI024 | ClickHouse | ClickStack: High-Performance Open Source Observability | Logs, Metrics, Traces with ClickHouse | |
| SI025 | ClickHouse | Data warehousing with ClickHouse | |
| SE001 | ClickHouse | ClickHouse Cloud | Cloud Based DBMS | ClickHouse | |
| SE002 | ClickHouse | Introduction | ClickHouse Docs | |
| SE003 | ClickHouse | Architecture overview | ClickHouse Docs | |
| SE004 | ClickHouse | Architecture Overview | ClickHouse Docs | |
| SE005 | ClickHouse | MergeTree table engine | ClickHouse Docs | |
| SE006 | ClickHouse | Shared | ClickHouse Docs | |
| SE007 | ClickHouse | Kafka table engine | ClickHouse Docs | |
| SE008 | ClickHouse | Integrating dbt and ClickHouse | ClickHouse Docs | |
| SE009 | ClickHouse | Integrations | ClickHouse Docs | |
| SE010 | ClickHouse | Cloud changelog - 2026 | ClickHouse Docs | |
| SE011 | ClickHouse | Monitoring Cloudflare logs with ClickStack | ClickHouse Docs | |
| SE012 | ClickHouse | ClickHouse adopters | ClickHouse Docs | |
| SE013 | ClickHouse | Trouble will find you: How Cloudflare uses ClickHouse to scale analytics at quadrillion-row scale | |
| SE014 | GitHub | GitHub - ClickHouse/ClickHouse: ClickHouse® is a real-time analytics database management system | |
| SE015 | GitHub | GitHub - ClickHouse/clickhouse-docs: Official documentation for ClickHouse | |
| SE016 | DB-Engines | ClickHouse System Properties | |
| SE017 | PyPI | clickhouse-connect · PyPI | |
| SE018 | npm | @clickhouse/client - npm | |
| SE019 | Docker | clickhouse/clickhouse-server - Docker Image | |
| SE020 | Microsoft Learn | Power Query ClickHouse connector - Power Query | |
| SE021 | Business Wire | ClickHouse Cloud Is Now Generally Available on Microsoft Azure | |
| SE022 | TrustRadius | ClickHouse Reviews & Ratings 2026 | TrustRadius | |
| SE023 | HypeQuery | Seven Companies, One Pattern: Why Every Scaled ClickHouse Deployment Looks the Same | |
| SE024 | ClickHouse | User stories - ClickHouse | |
| SE025 | ClickHouse | How ClickHouse powers Netflix, Uber and Spotify’s Analytics | Aaron Katz, CEO of ClickHouse | |
| SU001 | ClickHouse | User stories | |
| SU002 | ClickHouse | ClickHouse adopters | |
| SU003 | ClickHouse | Trouble will find you: How Cloudflare uses ClickHouse to scale analytics at quadrillion-row scale | A single query scanned 96 trillion events in an hour and returned in less than two seconds. |
| SU004 | ClickHouse | London Meetup Report: How Cloudflare processes hundreds of millions of rows per second with ClickHouse | This year we actually exceeded a thousand active replicas. That's processing hundreds of millions of inserted rows every second. |
| SU005 | Cloudflare | HTTP Analytics for 6M requests per second using ClickHouse | Cloudflare has grown tremendously... from under 1M requests per second to current levels of 6M requests per second. |
| SU006 | Cloudflare | Log analytics using ClickHouse | CPU and memory consumption on the inserter side were reduced by eight times. |
| SU007 | Cloudflare | Our billing pipeline was suddenly slow. The culprit was a hidden bottleneck in ClickHouse | This pipeline powers hundreds of millions of dollars in usage revenue, fraud systems, and more. |
| SU008 | ClickHouse | How ClickHouse powers Netflix, Uber and Spotify’s Analytics | |
| SU009 | HypeQuery | Seven Companies, One Pattern: Why Every Scaled ClickHouse Deployment Looks the Same | Uber's QueryBridge migration preserved 10,000+ Kibana dashboards with zero user retraining. |
| SU010 | ClickHouse | Contentsquare migration from Elasticsearch to ClickHouse | ClickHouse turned out to be 11 times cheaper and allowed us to have a 10x performance improvement in our p99 for queries. |
| SU011 | ClickHouse | Why OpenAI chose ClickHouse for petabyte-scale observability | Every day, the company ingests petabytes of log data... and that volume is growing by more than 20% each month. |
| SU012 | ClickHouse | How Anthropic is using ClickHouse to scale observability for the AI era | ClickHouse played an instrumental role in helping us develop and ship Claude 4. |
| SU013 | ClickHouse | How Tesla built a quadrillion-row-scale observability platform on ClickHouse | Over one quadrillion rows ingested—with not a single hiccup, not a single issue. |
| SU014 | Microsoft Clarity | Why Microsoft Clarity chose ClickHouse | Heat map generation became an instantaneous task to do, and it was even orders of magnitude cheaper to run. |
| SU015 | ClickHouse | Replo uses ClickHouse to power real-time merchant analytics | Replo, an AI-powered page builder trusted by more than 4,000 Shopify merchants... [is] capable of processing and analyzing more than 100 billion events. |
| SU016 | ClickHouse | Mintlify boosts NPS 30% and saves 60% with real-time analytics on ClickHouse Cloud | Dashboards that took tens of seconds to load in PostHog now return results in under a second. |
| SU017 | ClickHouse | How Padlet uses ClickHouse Cloud to power real-time classroom analytics | Padlet ingested roughly 8 billion events into ClickHouse in a single month. |
| SU018 | ClickHouse | How Buildkite transformed test analytics and cut costs with ClickHouse Cloud | For every dollar spent on ClickHouse, the team is saving eight dollars elsewhere. |
| SU019 | ClickHouse | Just OLAP it: How Ramp rebuilt its analytics platform on ClickHouse Cloud | When those customers tried to run reports... charts that once timed out after 40 seconds were returning in milliseconds. |
| SU020 | ClickHouse | Goodbye limitations, hello data: How Qonto is rethinking observability with ClickHouse Cloud | Qonto’s ResourceAttributes and SpanAttributes columns store 231 TB of uncompressed data in 376 GB. |
| SU021 | ClickHouse | How Langfuse is scaling LLM observability for the agentic era with ClickHouse | Compared to the old approach, the new model delivered around three times less memory usage and up to 20 times faster queries. |
| SU022 | ClickHouse | Lyft analytics on ClickHouse Cloud | Reading more than 450 terabytes of data per day and writing around 4 terabytes... |
| SU023 | ClickHouse | How Polymarket scales data with Postgres and ClickHouse | The API now handles 100s requests per second, with an average latency of approximately 25 milliseconds. |
| SU024 | ClickHouse | ClickHouse Cloud | Available on all three major cloud marketplaces. |
| SU025 | ClickHouse | Pricing philosophy | We automatically scale unused resources down to zero so that you don’t pay for idle services. |
| SU026 | TrustRadius | ClickHouse reviews | Managing roles in the Cloud could have more options... [and] it still misses SSO compatibility with some Identity Providers like Okta. |
| SU027 | PeerSpot | ClickHouse reviews | My experience with pricing, setup cost, and licensing indicates that it is very expensive—ClickHouse is the most expensive option. |
| SU028 | LeadCognition | Companies and developers using ClickHouse | ClickHouse is used in production at Cloudflare, Uber, eBay, Spotify... |
| SU029 | Uber Engineering | Uber Engineering blog homepage | |
| SU030 | Spotify Engineering | Spotify Engineering homepage | |
| SU031 | G2 | ClickHouse reviews | |
| SU032 | CaseStudies.com | ClickHouse B2B case studies and customer successes | |
| SR001 | Business Wire | ClickHouse Raises $350 Million Series C to Power Analytics for the AI Era | The company grew over 300% during the past year and now serves over 2,000 customers. |
| SR002 | Yahoo Finance | ClickHouse Raises $350 Million Series C to Power Analytics for the AI Era | |
| SR003 | Goodwin | Goodwin Advises ClickHouse on $350 Million Series C Financing and Extension to Accelerate Growth in Real-Time Analytics for the AI Era | News & Events | Goodwin | |
| SR004 | ClickHouse | ClickHouse Cloud | Cloud Based DBMS | ClickHouse | ClickHouse Cloud offers a serverless hosted DBMS solution. Automatic scaling and no infrastructure to manage at consumption-based pricing. |
| SR005 | ClickHouse | ClickHouse Pricing | |
| SR006 | ClickHouse Docs | Service uptime | ClickHouse Docs | |
| SR007 | ClickHouse Cloud Status | ClickHouse Cloud Status | |
| SR008 | ClickHouse Docs | Compliance overview | ClickHouse Docs | |
| SR009 | ClickHouse | ClickHouse Privacy Policy | |
| SR010 | ClickHouse | ClickHouse benchmarks: Performance, cost & scalability compared | |
| SR011 | ClickHouse | Real-Time Data Analytics Platform | ClickHouse | |
| SR012 | ClickHouse Docs | Deployment modes | ClickHouse Docs | |
| SR013 | ClickHouse | Real-time Analytics with ClickHouse | |
| SR014 | ClickHouse | Data warehousing with ClickHouse | |
| SR015 | ClickHouse | Welcome to the ClickHouse Community | |
| SR016 | ClickHouse Docs | ClickHouse adopters | ClickHouse Docs | |
| SR017 | GitHub | GitHub - ClickHouse/ClickHouse: ClickHouse® is a real-time analytics database management system | |
| SR018 | GitHub | Contributors to ClickHouse/ClickHouse | |
| SR019 | ClickHouse Docs | Security changelog | ClickHouse Docs | |
| SR020 | GitHub Security Advisory | CVE-2025-1385- Fail input validation in clickhouse-library-bridge API could lead to RCE | |
| SR021 | GitHub Security Advisory | Role-based Access Control is bypassed when query caching is enabled. | |
| SR022 | NIST National Vulnerability Database | NVD - CVE-2025-1385 | |
| SR023 | Altinity | Is ClickHouse® Moving Away from Open Source? | Important new features are available only in ClickHouse Cloud. |
| SR024 | Exasol | How 5 Databases Actually Scale across Concurrency, Data, and Nodes | |
| SR025 | Habr | StarRocks vs. ClickHouse, Apache Druid, and Trino | |
| SR026 | DB-Engines | DB-Engines Ranking | |
| SR027 | DB-Engines | ClickHouse System Properties | |
| SR028 | Chaos and Order | Database Engines 2026 Deep-Dive — Postgres Won the API, ClickHouse and DuckDB Won Analytics | |
| SR029 | TechCrunch | Snowflake, Databricks challenger ClickHouse hits $15B valuation | TechCrunch | |
| SR030 | Tinybird | Honest guide to the best ClickHouse® alternatives in 2026 | |
| SV001 | ClickHouse / BusinessWire | ClickHouse Raises $350 Million Series C to Power Analytics for the AI Era | |
| SV002 | Yahoo Finance | ClickHouse Raises $350 Million Series C to Power Analytics for the AI Era | |
| SV003 | AIM Research | ClickHouse Raises $350M at $6.35B: Future of Analytics | |
| SV004 | SiliconANGLE | ClickHouse reels in $350M for its high-speed columnar database | |
| SV005 | FinSMEs | ClickHouse Raises $350M in Series C Funding | |
| SV006 | ClickHouse | ClickHouse Pricing | |
| SV007 | ClickHouse | ClickHouse Cloud | Cloud Based DBMS | ClickHouse | |
| SV008 | ClickHouse Docs | What is ClickHouse? | ClickHouse Docs | |
| SV009 | ClickHouse | Snowflake vs ClickHouse | |
| SV010 | ClickHouse | Real-time Analytics with ClickHouse | |
| SV011 | Sacra | ClickHouse revenue, funding & news | |
| SV012 | Sacra | ClickHouse at $160M ARR | |
| SV013 | FirstMark | ClickHouse Raises $350 Million Series C to Power Analytics for the AI Era | |
| SV014 | MarketScreener | ClickHouse, Inc. announced that it has received $350 million in funding from a group of investors | |
| SV015 | Databricks | Databricks Grows >65% YoY, Surpasses $5.4 Billion Revenue Run-Rate, Doubles Down on Lakebase and Genie | |
| SV016 | Reuters / U.S. News | Databricks Raises $5 Billion in Latest Funding, Defying Software Selloff | |
| SV017 | CNBC | Databricks raises capital at $134 billion valuation in latest funding round | |
| SV018 | Sacra | Databricks revenue, valuation & funding | |
| SV019 | Stock Analysis | Snowflake (SNOW) Revenue 2019-2026 | |
| SV020 | CompaniesMarketCap | Snowflake (SNOW) - Market capitalization | |
| SV021 | Nasdaq | Snowflake Reports Financial Results for the Fourth Quarter and Full-Year of Fiscal 2026 | |
| SV022 | Securities and Exchange Commission | Snowflake 10-K for fiscal year ended January 31, 2026 | |
| SV023 | Stock Analysis | MongoDB (MDB) Revenue 2016-2026 | |
| SV024 | CompaniesMarketCap | MongoDB (MDB) - Market capitalization | |
| SV025 | DB-Engines | DB-Engines Ranking | |
| SV026 | SingleStore | SingleStore Delivers Record Performance in the Second Quarter of Fiscal Year 2026 | |
| SV027 | Tracxn | SingleStore | |
| SV028 | StorageNewsletter | SingleStore Delivers Record Performance in the Second Quarter of Fiscal Year 2026 | |
| SV029 | Owler | SingleStore’s Competitors, Revenue, Number of Employees, Funding, Acquisitions & News - Owler Company Profile | |
| SV030 | ClickHouse | Learn about the latest ClickHouse tips, tricks and company announcements. |