September 17, 2025 in Executive Edge

Composable or Collapse? The Infrastructure Decision Facing Every AI Leader

SHARE: PRINT ARTICLE:print this page https://doi.org/10.1287/LYTX.2025.04.08

The rise of artificial intelligence (AI) is reshaping entire industries, from creative applications powered by generative models to predictive analytics, transforming how business decisions are made. But behind this revolution is a rapidly emerging crisis: Legacy infrastructure is no longer enough.

According to the 2025 State of AI Infrastructure Report, 44% of IT leaders cite infrastructure limitations as the primary barrier to scaling AI. Meanwhile, more than 5,400 U.S. data centers are already strained under increasing workloads. The message is clear: AI’s evolution is outpacing the architecture built to support it.

Meeting the escalating compute and memory demands of modern AI requires more than incremental upgrades – it calls for a fundamental shift in infrastructure. Compute Express Link (CXL) is emerging as a vendor-neutral solution that delivers the scalability, performance and efficiency needed to support AI’s continued evolution.

AI Workloads Break Traditional Infrastructure

AI workloads are fundamentally different from conventional applications. Model training and inference tasks are both memory- and compute-intensive. As AI systems grow in size and complexity, the underlying infrastructure must handle massive data throughput with low latency to support the real-time processing.

Key challenges accelerating the breakdown of today’s architectures:

  • Data growth is exploding: 83% of organizations anticipate AI workload expansion in the next 12 months, and over half expect data needs to rise by more than 25%, fueled by unstructured sources such as video, speech and sensor input.
  • Underinvestment is widening the gap: Although 94% of companies say they’re confident in their AI road maps, only 17% are planning more than three years ahead. This lack of foresight leaves most enterprises unprepared for AI’s infrastructure demands.
  • Memory bottlenecks are the hidden limit: Current systems heavily depend on dynamic random access memory (DRAM) modules directly attached to CPUs. Because this memory capacity is dedicated and cannot be shared across GPUs, users end up deploying more memory than is truly needed at any point in time, raising the total cost of ownership (TCO). Worse, local memory capacity and bandwidth limit performance when models require more memory than is locally available.
  • Capital costs are unsustainable: McKinsey estimates that by 2030, global data center investments will hit $7 trillion – $5.2 trillion of which will be spent solely on AI-related infrastructure. Simply adding more hardware isn’t sustainable from a cost or energy perspective.

CXL: A New Foundation for AI-Centric Architecture

CXL is a high-speed interconnect standard designed to enable low-latency, high-bandwidth communication between CPUs, GPUs, accelerators and memory. Unlike traditional architectures, CXL enables memory to be shared across multiple devices – dynamically, efficiently and coherently.

Here’s how CXL redefines AI infrastructure:

  • Elastic memory pooling: CXL allows memory to be pooled and shared across heterogeneous compute elements. Rather than overprovisioning memory for each processor, enterprises can dynamically allocate capacity based on real-time workload needs – improving efficiency and cutting costs.
  • Composability with CXL 3.2: The latest CXL specification enhances memory device monitoring, enables smarter tiering with the Hot-Page Monitoring Unit and improves security through the Trusted Security Protocol. These upgrades make it easier to disaggregate and reassemble compute, memory and storage on demand – strengthening the modularity and flexibility of composable infrastructure.
  • Solving the memory wall: With traditional DRAM, adding memory means adding CPUs or system boards, which consume unnecessary cost, power and space. CXL enables high-capacity memory modules to be accessed by multiple devices without hardware duplication, breaking the memory wall that limits AI model performance.
  • Efficiency drives sustainability: By enabling right-sized infrastructure and minimizing idle memory, CXL reduces energy waste. As AI’s power draw becomes a growing concern, especially in large-scale deployments, technologies like CXL help organizations meet sustainability goals without sacrificing performance.

Strategic Advantages for Enterprises

CXL isn’t just about hardware – it’s a strategic enabler. Organizations that embrace it can outpace competitors and future-proof their operations. Here are four high-value use cases:

  1. Maximize memory utilization: With elastic memory pools, workloads can expand or contract based on actual usage. No more overprovisioning or idle DRAM.
  2. Eliminate stranded compute power: In legacy setups, compute resources can’t access memory beyond their node. CXL enables seamless memory access across devices, optimizing every GPU and accelerator.
  3. Adopt modular data center design: Composable architecture allows enterprises to build data centers like Lego blocks, scaling incrementally instead of overhauling entire systems.
  4. Cut carbon, not performance: Reducing duplicate memory resources and idle power lowers energy consumption – without compromising speed or performance.

Why Timing Matters

CXL is no longer experimental. It’s actively backed by leading chipmakers, cloud providers and system integrators, such as Intel, AMD, NVIDIA and Arm. Hyperscalers like Microsoft Azure and Google Cloud are integrating it into their next-gen systems.

The AI ecosystem is expanding beyond centralized cloud training. Inference is moving to the edge. Private data centers are playing a larger role in secure model deployment. This distribution increases pressure on infrastructure – and the cost of inertia.

Organizations that wait to adopt modern architectural standards risk losing competitive ground, facing higher costs and performance bottlenecks. Those that act now will build the infrastructure advantage necessary for long-term leadership in the AI era.

Lead the Shift, or Lag Behind

AI’s exponential growth is not just changing how data is processed – it’s redefining what infrastructure must look like. Traditional systems, built for static, compute-bound workloads, can no longer support the data-hungry, memory-intensive demands of AI.

CXL doesn’t just add efficiency – it fundamentally transforms how compute and memory interact. It unlocks composability, reduces TCO and allows for sustainable scaling. Most importantly, it positions enterprises to move as fast as the AI technologies they deploy.

Adopting CXL is more than a technical upgrade – it’s a business-critical decision. It equips enterprises and hyperscalers to do more with what they have, accelerate AI deployments and stay ahead of innovation curves.

The opportunity – and the urgency – are clear. Infrastructure has become a defining edge in AI. And for organizations that seize this moment, CXL will be more than a backbone. It will be a launchpad.

JB Baker

SHARE:

INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.