licensingacquisitionai training datamergerJune 26, 2026

OpenAI–Getty Ink Data Pact Amid $3.7B Shutterstock Merger

OpenAI secures licensed visual archives for ChatGPT as Getty Images leverages the deal to clear its $3.7B merger path.

Getty Images has secured a landmark multi-year display and licensing partnership with OpenAI, a move that has effectively re-rated the visual data giant’s market value and cleared the regulatory fog surrounding its pending $3.7 billion merger of equals with Shutterstock. The agreement, which integrates Getty’s authenticated visual libraries directly into ChatGPT’s search and discovery experiences, triggered a dramatic recovery in Getty’s share price, which rallied from a low of 58 cents to $1.29 within days of the announcement. For data asset investors, the deal represents more than a commercial win; it is a structural validation of Irreplaceable Content Assets (ICAs) as the primary collateral for the next generation of generative search.

The $3.7 Billion Consolidation Play

The partnership serves as the financial and regulatory linchpin for the disclosed $3.7 billion combination of Getty and Shutterstock, a deal that recently secured conditional approval from the UK’s Competition and Markets Authority (CMA). To satisfy antitrust concerns, Shutterstock has agreed to divest its entire global editorial business, including brands like Rex Features and Splash News. By securing a high-margin licensing stream from OpenAI, the combined entity aims to de-lever its balance sheet and escape the debt constraints that have plagued the stock-photo sector. This consolidation confirms that in the AI era, scale is no longer measured by volume alone, but by the legal purity and "machine-readability" of the underlying dataset.

Google’s Aggressive Publisher Pilot

While Getty embraces licensing, Google is reportedly tightening the screws on news organizations. The search giant is currently pitching a broadened pilot program for Google News that would grant the company all-encompassing rights to use publisher content for AI model training. According to reports, publishers who decline to participate risk losing access to payments from Google’s older "Showcase" program, which the company plans to sunset. This "all-or-nothing" approach highlights the growing power imbalance in the data market, where hyperscalers are leveraging their distribution dominance to secure training data at rates that some publishers have described as pausingly low.

The Rise of Revenue-Share Marketplaces

In contrast to the flat-fee licensing model, Perplexity AI has launched a formal Publisher Program that introduces a performance-based payout structure. The company has reportedly set aside a disclosed $42.5 million payout pool for its "Comet Plus" initiative. Under this model, Perplexity offers an estimated 80/20 revenue split, where publishers receive the lion's share of subscription revenue when their content is cited by the AI assistant. This shift toward granular, citation-based monetization is gaining traction among mid-tier publishers like Der Spiegel, Fortune, and Time, who see it as a more sustainable alternative to the one-off licensing deals favored by OpenAI and Google.

Government and Industrial Data Demand

The market for specialized data assets is also expanding into the public sector. NASA has recently broadened its Commercial Satellite Data Acquisition (CSDA) initiative, awarding a contract vehicle with a maximum cumulative valuation of $476 million. This program streamlines the ingestion of commercial Earth observation data into a cloud-native clearinghouse, serving civil agencies and scientific institutions. Simultaneously, in the private sector, digital infrastructure firm Z Squared has entered a binding letter of intent to acquire Paradox Data, a specialist in high-density compute for data-intensive workloads, further illustrating the convergence of data ownership and processing power.

The Legal Stick: Suno and the $150k Penalty

The "carrot" of licensing is being driven by the "stick" of massive litigation. The Recording Industry Association of America (RIAA) continues to pursue claims against AI music startups Suno and Udio, seeking statutory damages of up to $150,000 per infringed work. While Warner Music Group has already disclosed a settlement and licensing partnership with Suno, Sony and Universal remain in active litigation. These cases are forcing AI developers to move away from "stream-ripping" and toward licensed versions of their models, effectively creating a two-tier market where licensed data is the only viable path for enterprise-grade AI applications.

Why it matters for data owners

For data owners, the Getty–OpenAI deal and the emergence of Perplexity’s payout pool signal that the "scraping era" is ending. High-quality, human-verified data is no longer a commodity; it is a strategic asset with a measurable ROI. As regulators in the EU move to implement transparency and watermarking obligations under the AI Act, the premium for licensed, indemnified data will only grow. Owners of proprietary archives now have multiple monetization paths—from multi-year flat-fee deals to RAG-based royalties—allowing them to treat their data as a high-yield financial instrument rather than just a byproduct of their primary business.

d-nvest turns the data assets behind these deals into scored, actionable opportunities.

Explore the pipeline →
OpenAI–Getty Ink Data Pact Amid $3.7B Shutterstock Merger | d-nvest