03/26/2026 | Press release | Archived content
80-90% of enterprise data is unstructured, scattered across incompatible storage environments, and physically pinned wherever it was first created. The models are ready. The compute is available. The data is stuck.
Data gravity is the infrastructure world's version of a black hole: the more data accumulates somewhere, the harder it becomes to move. Applications attach. Workflows calcify. Latency assumptions get baked in. After years, the data is immovable owing to its accumulation. Enterprise data pools across on-premise NAS arrays, multiple cloud regions, cold archives, and departmental tools that cannot see each other. At current AWS egress rates of $0.09/GB, transferring a single petabyte costs roughly $90,000 in fees alone, before downtime or re-validation. Gartner notes that 63% of organizations lack the data practices to even make their data AI-ready.
For traditional IT workloads, this was a nuisance. For AI, it's a hard wall. Fine-tuning on proprietary data is now the primary enterprise AI differentiator, and it requires accessing distributed internal data at scale. The workaround is weeks of manual staging per training run. Inference is worse: RAG pipelines need context in milliseconds; cold archives three network hops away cause production failures. This is the gap between a compelling demo and a working deployment. Nearly every enterprise ML team has hit it.
H100s now rent from $2-4/hour on major cloud platforms. Even so, Run:ai found ~40% of enterprise GPU time is lost to I/O wait. On a 100-GPU cluster at $3/hr average, that's roughly $260,000 wasted per month - from data that can't reach compute fast enough.
| Metric | Figure | Source |
| Enterprise data that is unstructured | 80-90% | Gartner, 2024; IDC StorageSphere, 2024 |
| Orgs lacking AI-ready data practices | 63% | Gartner Newsroom, Feb 2025 |
| GPU time lost to I/O wait, enterprise clusters | ~40% | Run:ai GPU Utilization Report, 2025 |
| AWS egress cost per GB (outbound internet) | $0.09/GB | AWS Pricing, verified Q1 2025 |
| Cost to transfer 1 PB (egress only) | ~$90K | Derived from AWS list pricing |
| Unstructured data growth, year-over-year | 55-65% | IDC Global DataSphere Forecast, 2025 |
Hyperscalers can't. Data gravity is their moat. An AWS product that makes it trivially cheap to move data to Azure is corporate self-harm. Multicloud and hybrid orchestration will never come from a cloud provider; the market structurally requires a neutral party.
Legacy storage vendors won't. NetApp and Dell EMC manage individual clusters well. Their architecture assumes data lives somewhere specific and compute comes to it. A decade of managing installed base has left them unable to pivot fast enough.
Startups are solving the wrong problem. Most are either racing S3 to zero on object storage pricing, or building data catalogs - metadata tools that tell you where data lives without fixing the latency of getting it. Knowing where the food is isn't the same as being able to eat it.
In conventional storage, metadata and data are bound together. The record of where a file lives is stored with the file itself. Any global operation requires touching the actual data. At petabyte scale across a dozen environments, this is why everything is slow.
Hammerspace maintains a globally distributed metadata plane independent of where data physically sits. The result: a single coherent namespace spanning every storage environment - on-premise, cloud, archive - without moving anything. From any compute environment, data appears local.
The right analogy is BGP: the routing protocol that makes the internet work across thousands of independent networks without owning any of them. BGP didn't look like a business until everything ran on top of it. Hammerspace is building the equivalent coordination layer for enterprise data: neutral, invisible, and load-bearing for every AI workload above it.
The execution moat is real. Building a metadata system that stays consistent across heterogeneous environments, at millisecond latency, while handling fifteen years of enterprise infrastructure debt requires implementation-level knowledge of distributed file systems. The founding team built the original NFS protocol.
The namespace gets more valuable with every environment registered. Unlike most infrastructure software, this is a genuine network effect: the map becomes more complete, switching costs compound, and the competitive gap widens with every customer added. Hyperscalers are structurally prevented from replicating it. Open source can copy the concept but not the years of certified interoperability with dozens of storage environments.
If the global namespace becomes infrastructure, the position that follows is extraordinary: every AI workload touching unstructured enterprise data runs through this layer. Pricing power, expansion into policy enforcement and governance, and a moat that deepens with scale. TCP/IP didn't look like a business until everything depended on it.
Everyone is watching the model race. The more interesting company is building the road the models have to run on.
Sources
Published by SPLY Capital · March 2026