joint venturemedical dataai licensingbiotech fundingJune 19, 2026

SoftBank and Tempus AI Launch $200M Medical Data Joint Venture

The deal establishes a Tokyo-based entity to monetize genomic and clinical data assets for the Japanese market.

SoftBank Group Corp. has committed a disclosed $200 million investment into a new joint venture with Tempus AI to deploy a massive library of genomic and clinical data across the Japanese healthcare market. The deal, announced this week, marks a significant escalation in Masayoshi Son’s strategy to pivot the conglomerate toward specialized AI applications, specifically targeting the high-moat sector of medical data assets. The joint venture will be headquartered in Tokyo and will function as the exclusive vehicle for Tempus’s data-driven diagnostic technologies in the region.

Monetizing the 100-Petabyte Data Library

The core value proposition of the deal lies in the proprietary data architecture developed by Tempus AI. The company currently manages a library of approximately 100 petabytes of clinical and molecular data, which it uses to provide physicians with AI-generated insights for cancer treatment and other chronic conditions. By forming this joint venture, SoftBank is effectively licensing this data-processing engine to address the specific regulatory and linguistic requirements of the Japanese medical system. This follows Tempus AI's recent initial public offering, which raised $410 million and valued the company at approximately $6.1 billion.

For SoftBank, the $200 million injection is more than a capital play; it is an acquisition of regional data sovereignty. The venture will integrate Japanese patient data with Tempus’s existing algorithmic frameworks, creating a localized dataset that is expected to become the primary infrastructure for precision medicine in the country. Analysts note that this move mirrors SoftBank's broader trend of securing "Vertical AI" assets—data-rich companies that solve specific industry problems rather than general-purpose LLMs.

The Rise of Biological and Specialized Data Rounds

The SoftBank-Tempus deal arrives amidst a surge of funding for companies that treat data as a primary biological asset. Just 24 hours ago, EvolutionaryScale announced a $142 million seed round to develop AI models capable of designing new proteins. Led by former Meta researchers, the startup is leveraging massive biological datasets to create what they call "programmable biology." This highlights a shift in the investment landscape: capital is moving away from model-agnostic infrastructure and toward firms that own the underlying training data for high-value scientific domains.

Similarly, the hardware layer is being redesigned to handle these specific data flows. Etched.ai recently secured $120 million in Series A funding to build specialized chips optimized for Transformer-based models. As datasets grow in complexity—moving from text to multi-modal genomic and engineering data—the industry is seeing a convergence where data ownership and compute optimization must happen in tandem to maintain competitive margins.

Legal Precedents for Data Acquisition

As the market for data assets matures, the legal boundaries of data acquisition are being tested in major jurisdictions. A pivotal ruling was issued this week as a U.S. judge rejected Meta's attempt to stop Bright Data from scraping its public information. The court ruled that scraping publicly available data does not violate Meta's terms of service if the scraper is not a logged-in user. This decision provides significant legal cover for data marketplaces and AI firms that rely on large-scale web harvesting to build their training corpuses, potentially lowering the barrier to entry for new data-intensive startups.

However, the regulatory environment remains a double-edged sword. While web scraping saw a win, the security of enterprise data is commanding higher premiums. Vanta, a trust management platform, raised $150 million at a $2.45 billion valuation to automate compliance and data security. As companies like SoftBank and Tempus move sensitive medical records across borders, the demand for "clean" and compliant data pipelines is becoming the primary bottleneck for global data deals.

Why it matters for data owners

The SoftBank-Tempus partnership signals that the most lucrative exit for data owners is no longer a simple sale, but a structured joint venture that preserves the long-term value of the asset. For owners of proprietary datasets—whether in healthcare, engineering, or logistics—the market is shifting toward localized licensing models. By retaining equity in the regional entities that process their data, owners can monetize their assets repeatedly across different markets while navigating local sovereign data laws. This deal proves that in the AI era, the dataset is not just a commodity; it is the core capital of the enterprise.

d-nvest turns the data assets behind these deals into scored, actionable opportunities.

Explore the pipeline →