infrastructureai fundingdata licensingJune 19, 2026

KKR and ECP Strike $50B Partnership for AI Data Infrastructure

Strategic alliance targets the massive data-power gap as generative AI scaling demands unprecedented compute capacity.

KKR and Energy Capital Partners (ECP) have finalized a disclosed $50 billion (https://www.kkr.com/news-details/kkr-and-ecp-announce-50-billion-strategic-partnership-to-accelerate-data-center-and-power-infrastructure-for-ai) strategic partnership to accelerate the development of data centers and power infrastructure globally. This massive capital commitment is designed to address the "data-power gap" that has become the primary bottleneck for scaling generative AI assets. By combining KKR’s global infrastructure expertise with ECP’s specialized focus on energy transition and power generation, the alliance aims to provide the stable, high-capacity environments necessary for the largest frontier models to operate. This move signals that the market for data assets is no longer just about the information itself, but about the physical and energetic sovereignty required to process it at scale.

The Convergence of Data and Power

The deal arrives at a moment when the AI industry is grappling with the sheer physical requirements of data processing. Hewlett Packard Enterprise (HPE) and Nvidia recently launched a joint venture, "Nvidia AI Computing by HPE," (https://www.hpe.com/us/en/newsroom/press-release/2024/06/hewlett-packard-enterprise-and-nvidia-announce-nvidia-ai-computing-by-hpe.html) which integrates private cloud data capabilities directly with Blackwell-era compute. This private-cloud trend, mirrored by the KKR-ECP partnership, suggests a shift away from public cloud reliance toward bespoke, asset-heavy data environments. Large-scale data owners are increasingly seeking to co-locate their proprietary datasets within these specialized infrastructures to minimize latency and ensure data security, a trend that KKR and ECP are positioning themselves to capitalize on with their multi-billion dollar build-out.

Data Licensing and the Publisher Pivot

While infrastructure scales at the multi-billion dollar level, the fight for high-quality training data is entering a more formalized licensing phase. Perplexity AI has officially launched its "Publishers Program," (https://www.theverge.com/2024/6/18/24181287/perplexity-ai-publishers-program-revenue-sharing) a revenue-sharing model designed to compensate media organizations for the use of their data in AI-generated answers. This follows intense scrutiny over data scraping practices and represents a growing industry standard where data is treated as a licensed asset rather than a public resource. Similarly, Sony Music Group has issued a formal warning to over 700 tech firms (https://www.reuters.com/technology/sony-music-warns-tech-firms-over-unauthorized-ai-use-2024-05-16/) against the unauthorized use of its massive catalog for AI training, further tightening the global supply of premium, legally-cleared datasets.

Funding the Next Frontier of Data Safety

Investment is also flowing into the safety and governance layers of the data economy. Ilya Sutskever, the former chief scientist of OpenAI, has launched Safe Superintelligence Inc. (SSI) (https://ssi.inc/), a new venture dedicated solely to the development of safe, powerful AI through rigorous, data-intensive safety research. While the initial funding amount remains undisclosed, the venture's focus on "safety-first" data training protocols highlights a new premium on datasets that are not only large but curated for alignment. This coincides with the European Commission's latest guidelines on high-risk AI data classification (https://digital-strategy.ec.europa.eu/en/policies/regulatory-sandboxes-ai), which will force developers to maintain stricter provenance records for any data asset used in the EU market.

Strategic Industrial Data Expansion

Beyond the consumer web, industrial and manufacturing data assets are seeing significant capital inflows. Bright Machines recently secured $126 million in Series C funding (https://techcrunch.com/2024/06/18/bright-machines-raises-126m-to-automate-electronics-manufacturing/) to scale its software-defined manufacturing platform. The company’s ability to generate and leverage proprietary manufacturing data is a prime example of how vertical-specific datasets are becoming high-value investment targets. As general-purpose web data becomes commoditized or restricted by licensing walls, the focus of the data-asset market is shifting toward these specialized, high-fidelity industrial streams that can power autonomous factories and supply chains.

Why it matters for data owners

For data owners, the KKR-ECP partnership and the rise of programs like Perplexity’s signal a dual-track monetization path. First, the value of "clean" data—legally licensed and safety-aligned—is skyrocketing as major players seek to avoid the litigation risks highlighted by Sony Music. Second, the massive investment in physical infrastructure means that data owners who can offer "data-plus-compute" packages, or who own data assets located in energy-secure jurisdictions, will command a significant premium. We are moving from a world of data abundance to one of high-quality data scarcity, where the infrastructure to process that data is as valuable as the data itself.

d-nvest turns the data assets behind these deals into scored, actionable opportunities.

Explore the pipeline →
KKR and ECP Strike $50B Partnership for AI Data Infrastructure | d-nvest