SPLY Capital

03/26/2026 | Press release | Archived content

The Missing Layer

80-90% of enterprise data is unstructured, scattered across incompatible storage environments, and physically pinned wherever it was first created. The models are ready. The compute is available. The data is stuck.

First, It Is Essential to Understand the Physics That Created the Opening

Data Gravity Is Real - and AI Made It a Hard Wall

Data gravity is the infrastructure world's version of a black hole: the more data accumulates somewhere, the harder it becomes to move. Applications attach. Workflows calcify. Latency assumptions get baked in. After years, the data is immovable owing to its accumulation. Enterprise data pools across on-premise NAS arrays, multiple cloud regions, cold archives, and departmental tools that cannot see each other. At current AWS egress rates of $0.09/GB, transferring a single petabyte costs roughly $90,000 in fees alone, before downtime or re-validation. Gartner notes that 63% of organizations lack the data practices to even make their data AI-ready.

For traditional IT workloads, this was a nuisance. For AI, it's a hard wall. Fine-tuning on proprietary data is now the primary enterprise AI differentiator, and it requires accessing distributed internal data at scale. The workaround is weeks of manual staging per training run. Inference is worse: RAG pipelines need context in milliseconds; cold archives three network hops away cause production failures. This is the gap between a compelling demo and a working deployment. Nearly every enterprise ML team has hit it.

H100s now rent from $2-4/hour on major cloud platforms. Even so, Run:ai found ~40% of enterprise GPU time is lost to I/O wait. On a 100-GPU cluster at $3/hr average, that's roughly $260,000 wasted per month - from data that can't reach compute fast enough.

Metric Figure Source
Enterprise data that is unstructured 80-90% Gartner, 2024; IDC StorageSphere, 2024
Orgs lacking AI-ready data practices 63% Gartner Newsroom, Feb 2025
GPU time lost to I/O wait, enterprise clusters ~40% Run:ai GPU Utilization Report, 2025
AWS egress cost per GB (outbound internet) $0.09/GB AWS Pricing, verified Q1 2025
Cost to transfer 1 PB (egress only) ~$90K Derived from AWS list pricing
Unstructured data growth, year-over-year 55-65% IDC Global DataSphere Forecast, 2025

Why Nobody Has Built This

Hyperscalers can't. Data gravity is their moat. An AWS product that makes it trivially cheap to move data to Azure is corporate self-harm. Multicloud and hybrid orchestration will never come from a cloud provider; the market structurally requires a neutral party.

Legacy storage vendors won't. NetApp and Dell EMC manage individual clusters well. Their architecture assumes data lives somewhere specific and compute comes to it. A decade of managing installed base has left them unable to pivot fast enough.

Startups are solving the wrong problem. Most are either racing S3 to zero on object storage pricing, or building data catalogs - metadata tools that tell you where data lives without fixing the latency of getting it. Knowing where the food is isn't the same as being able to eat it.

The Hammerspace Insight: Separate the Namespace from the Data

In conventional storage, metadata and data are bound together. The record of where a file lives is stored with the file itself. Any global operation requires touching the actual data. At petabyte scale across a dozen environments, this is why everything is slow.

Hammerspace maintains a globally distributed metadata plane independent of where data physically sits. The result: a single coherent namespace spanning every storage environment - on-premise, cloud, archive - without moving anything. From any compute environment, data appears local.

The right analogy is BGP: the routing protocol that makes the internet work across thousands of independent networks without owning any of them. BGP didn't look like a business until everything ran on top of it. Hammerspace is building the equivalent coordination layer for enterprise data: neutral, invisible, and load-bearing for every AI workload above it.

The execution moat is real. Building a metadata system that stays consistent across heterogeneous environments, at millisecond latency, while handling fifteen years of enterprise infrastructure debt requires implementation-level knowledge of distributed file systems. The founding team built the original NFS protocol.

The Investment Lens

The namespace gets more valuable with every environment registered. Unlike most infrastructure software, this is a genuine network effect: the map becomes more complete, switching costs compound, and the competitive gap widens with every customer added. Hyperscalers are structurally prevented from replicating it. Open source can copy the concept but not the years of certified interoperability with dozens of storage environments.

If the global namespace becomes infrastructure, the position that follows is extraordinary: every AI workload touching unstructured enterprise data runs through this layer. Pricing power, expansion into policy enforcement and governance, and a moat that deepens with scale. TCP/IP didn't look like a business until everything depended on it.

What to Watch

  • Land-and-expand velocity - one environment per account, prove latency, expand to the full estate
  • Revenue from AI-specific workloads distinct from general data management contracts
  • Hyperscaler channel agreements - Hammerspace's neutrality makes this plausible in a way it wouldn't be for a cloud-native competitor
  • Gross margins trending toward software - the margin profile will tell you whether the product is working or whether it's being sold with implementation services

Everyone is watching the model race. The more interesting company is building the road the models have to run on.

Sources

  • Gartner - Unstructured Data & GenAI (2024)
  • Gartner - AI-Ready Data Newsroom (Feb 2025)
  • IDC StorageSphere Structured/Unstructured Forecast (2024-2028)
  • IDC Global DataSphere Forecast (2025-2029)
  • Run:ai - GPU Utilization eBook (2025)
  • AWS - Data Transfer Costs Overview
  • AWS - EC2 On-Demand Pricing (H100 / p5 instances)

Published by SPLY Capital · March 2026

SPLY Capital published this content on March 26, 2026, and is solely responsible for the information contained herein. Distributed via Public Technologies (PUBT), unedited and unaltered, on June 16, 2026 at 09:31 UTC. If you believe the information included in the content is inaccurate or outdated and requires editing or removal, please contact us at [email protected]