Dataset opportunity

Virta — Knowledge Base Dataset Opportunity

Large knowledge base dataset held by Virta, usable for Document Intelligence and RAG.

Knowledge Base DatasetTextDocument Intelligence🌍 Finlandvirta.globalJun 15, 2026

80/ 100

☆ Sign in to save

Score

79.9

Confidence

92%

Action

Data Sharing Agreement

Market

Global Intelligent Document Processing market size was valued at USD 2.3 billion in 2024 and is projected to grow at a CAGR of 24.7% between 2025 and 2034. [2]

External signals

Sourced by 5 recent signals

Recent dated external facts that triggered this opportunity — auditable provenance.

📰press2026-06-15
Avec Thales, Renault Group renforce sa présence sur le marché de la défense
journalauto.com ↗
📰press2026-06-12
Les équipementiers automobiles appellent à un renforcement de l’Industrial Accelerator Act
journalauto.com ↗
📰press2026-06-12
Chery France muscle sa direction pour soutenir son développement commercial
journalauto.com ↗
📰press2026-06-12
La Belgique approuve à son tour le système de conduite autonome de Tesla
journalauto.com ↗
📰press2026-06-12
Cédric Lacour et Gaël de Beauchesne, premières recrues de GAC Motor France
journalauto.com ↗

Lineage

How this lead was derived

The signal-first chain, end to end: recent external signals → qualified niche → resolved data-holder → site verification → scored opportunity. Every lead is explainable.

Data appetite

1 signals

Concrete evidence this company actively cares about data — why it's ripe for the deal room.

🔌Public API
Public Virta API for charging network management and data integration
source ↗

Profile

Dataset profile

Type

Knowledge Base Dataset

Modality

Text

Sector

mobility

Volume

Large

Freshness

Real-time

Rarity

High (proprietary)

Accessibility

Partial

Legal

Mixed ownership — GDPR-sensitive (PII review)

Buyer persona

Document-AI / IDP vendors

Virta holds a comprehensive Knowledge Base dataset in Text modality, derived from its extensive electric vehicle charging platform operations. This includes technical documentation, API guides, support articles, and operational procedures, making it a prime asset for training a Document Intelligence AI. Such an AI could automate customer support, enhance developer onboarding, and extract insights to streamline platform management.

The global Intelligent Document Processing market, a proxy for this use case, was valued at $2.3 billion in 2024 and is projected to grow at a 24.7% CAGR between 2025 and 2034. [2] Despite access complexities such as shared data ownership with Charge Point Operators and high GDPR sensitivity due to driver data, the dataset's value is immense. Its unique specificity to the EV charging domain provides a rare opportunity to build a highly specialized and valuable AI model, justifying the effort to navigate the necessary anonymization and consent frameworks. ⚠ Diligence (valuable data, access to negotiate): Data ownership is shared with Charge Point Operators (CPOs) using the platform.; High GDPR sensitivity due to EV driver location and charging habits.; Requires complex anonymization of individual charging sessions and payment records.; Northe subsidiary collects direct vehicle telemetry via OBDII which may have different consent terms. · corporate: independent.

Scoring

Scored dimensions

Explainable, evidence-based dimensions (0–100). The radar shows the investment axes.

Market read

This evidence collectively proves Virta owns a comprehensive, proprietary knowledge base covering the complex electric vehicle (EV) charging ecosystem. This dataset includes technical API documentation, product guides, changelogs, and support articles. For Document AI and Intelligent Document Processing (IDP) vendors, this is a rare source of domain-specific text essential for training models to understand the mobility sector's unique document formats. In a market projected to grow at over 24% annually, this dataset offers a significant competitive advantage for building next-generation document intelligence solutions.

See dimension details ↓

Dataset Freshness82
real-time/streaming
Training Value84
fit for Document Intelligence
Dataset Specificity100
dominant 'knowledge_base', sector mobility, 4 specific types
Dataset Rarity70
proprietary domain data (open lowers rarity)
Dataset Volume100
24 evidence hits, explicit data-volume mention
Buyer Demand85
The demand is driven by two converging, high-growth markets: the AI in Mobility market, projected to grow at a 44.6% CAGR (2026-2035), and the Intelligent Document Processing (IDP) market, a proxy for Document Intelligence, which is growing
Legal Accessibility60
open/API access
Acquisition Feasibility68
high difficulty, independent
Evidence Strength100
9 evidence types, 24 hits
Right to License28
ownership=mixed, licensing=gdpr_sensitive
Corporate Independence90
independent
Data Orientation39
1 data-appetite signals (1 types)
Dormant Data Surplus92
surplus=high, 5 recent external signals — proprietary data beyond what's already monetised
ICP Audit75
⚠ review — The company's core business is selling an EV charging management platform (SaaS) and derived intelligence/analytics via APIs, which is a form of selling intelligence, making it a bad fit. Issues: The company's core product is a Charge Point Management System (CPMS) called Virta Hub, which is a software platform for businesses to operate EV charging netwo; Virta explicitly offers 'Data access & analytics' and a suite of APIs for customers to integrate Virta's data and functionalities i

Evidence

Dataset evidence & lineage

What the typed evidence proves the company holds — reframed for clarity and set against the market.

Downloads / exports

Tabular · 2 hits

This indicates a collection of structured product communications and support materials, such as release notes, which are ideal for training models on product updates and customer-facing documents.

virta.global ↗virta.global ↗

Event streams

Time Series · 2 hits

This points to documentation describing real-time data protocols like OCPP, which is essential for training AI to understand technical specifications for IoT and mobility data streams.

virta.global ↗virta.global ↗

Industrial data

Time Series · 2 hits

This shows documentation exists for complex industrial use cases, including enterprise system integration (ERP, CRM) and energy management, a high-value niche for specialized document AI.

virta.global ↗virta.global ↗

API access

Multimodal · 1 hit

This proves the existence of structured documentation detailing core platform capabilities, valuable for training models to parse API specifications and technical feature lists.

virta.global ↗

Knowledge base / docs

Text · 1 hit

This is direct evidence of a centralized repository of technical knowledge, including guides and changelogs, representing a goldmine for training language models on complex support articles.

virta.global ↗

Developer portal

Multimodal · 1 hit

This confirms a formal, well-structured portal with extensive API documentation, providing high-value, real-world content for training models to understand technical developer guides.

virta.global ↗

Data-volume signal

Multimodal · 1 hit

This sample describes data access policies and analytics integration, providing text useful for training models to understand data governance and usage instructions within user guides.

virta.global ↗

IoT / sensor data

Time Series · 1 hit

This is evidence of documentation explaining the company's IoT data infrastructure, crucial for training models to understand the context of connected device data in technical manuals.

virta.global ↗

Geospatial data

Tabular · 1 hit

This indicates the presence of documentation related to geospatial analytics, a specialized domain for document intelligence models focused on location-based services and logistics.

virta.global ↗

Marketplace

Dataset details

Trustedby Virta →

Geographic coverage

Global

Time range

Real-time

Update frequency

Real-time

Delivery

API

Formats

Text

License

One-time license for internal use, AI training, and model development. Restrictions may apply to redistribution.

Personal data

No PII

From EUR 130,000· licence· final price on request

Indicative estimate, derived from public signals — not a quote, not contractual, and not agreed with the company. Is this your company? Correct it.

This proprietary, large-volume text dataset from Virta's EV charging platform is highly valuable for Document Intelligence due to its domain-specific technical and operational content. The strong growth in the Intelligent Document Processing market indicates significant demand.

Proprietary Automotive Technical Manuals — €90,000 - €150,000Large-scale Customer Support Knowledge Bases (SaaS) — €70,000 - €130,000

Detailed schema & sample available on access request.

Want this data?

Request access — we broker a secure deal room. Operator-reviewed, no automatic sharing.

Share this opportunity

This listing was generated automatically from public signals. It is not verified, and we are not affiliated with this company.

Coverage

Scanned sources

https://www.virta.globalingested

https://www.virta.global/virta-apiingested

https://www.virta.global/northe/mileage-reportingingested

https://www.virta.global/companyingested

https://www.virta.global/use-cases/ev-charging-hubsingested

https://www.virta.global/use-cases/heavy-dutyingested

https://www.virta.globalinferred

Deliverable

Premium dataset report

Virta Knowledge Base — a Large knowledge base dataset (Text modality) in the mobility domain. Primary AI use-case: Document Intelligence. Market signal: Global Intelligent Document Processing market size was valued at USD 2.3 billion in 2024 and is projected to grow at a CAGR of 24.7% between 2025 and 2034. [2]. Investment score 79.9/100 (confidence 0.92). Recommended action: Data Sharing Agreement.

Teaser is public · premium is locked behind access.

From the marketplace

Explore live data opportunities

mobility

Fleets Enterprises — Maintenance Logs Dataset Opportunity

View opportunity →mobility

Voltatrucks — Mobility Telemetry Dataset Opportunity

View opportunity →mobility

Transition One — Mobility Telemetry Dataset Opportunity

View opportunity →

Document41 Mobility Telemetry Dataset52 Knowledge Base Dataset15 Mobility & Geospatial Dataset4 Mobility Event Dataset3 Mobility158

Browse datasets by sector & use-case

Data Academy

Learn before you deal

All guides

News & Insights

Latest from the briefing

All briefings