Skip to content
Validata

Validata

Powering AI Agents with Trustless, Rewarded Data.

Created on 1st March 2025

Validata

Validata

Powering AI Agents with Trustless, Rewarded Data.

The problem Validata solves

Another day, another gen AI model. It seems like the speed of development has sped up so quickly over the past couple of years and we have seen the constant war of how ChatGPT, Grok, Deepseek, and Claude progress and perform. These models have changed our lives, but have only been released recently and can only get better due to increased data and better model design.
Data is the true source behind AI’s power. However, AI agents in Web3 face a critical data bottleneck—developers struggle to access high-quality, real-time datasets. On the flip side, there are limited incentives for data providers to data, whether due to lack of direct rewards, expensive storage, or IP concerns. Existing data solutions are centralized, siloed, expensive, and often biased, limiting AI’s potential in decentralized ecosystems.
Some people, including artists, musicians, scientists, and other organizations, have information that could help advance the quality of AI models, but are hesitant to allow their work to train AI models out of lack of rewards to the original creators. As a result, AI models face scrutiny when their work emulates that created by humans and lack of diverse high-quality data slows down innovation and restricts development. Additionally, there is no trustless system to source, benchmark, and incentivize AI training data while ensuring security and fairness. Without one, AI innovation in Web3 will remain restricted, limiting the development of the on-chain AI future.
Validata tackles this issue by creating a decentralized data ecosystem where users contribute valuable datasets, developers train AI agents on these datasets, and a ZK-secured benchmarking system ensures fairness. Contributors and developers are incentivized to provide impactful data and achieve stronger performance on AI benchmarks, respectively. By leveraging Web3 infrastructure, Validata democratizes AI development while ensuring trust, privacy, and scalability.

User Interaction and Data Flow

Validata provides a seamless, decentralized experience where users contribute data, developers train AI agents, and smart contracts manage rewards through a transparent, ZK-secured process.

The project architecture and development process

How It Works

  1. Decentralized Data Contribution
    Users contribute structured datasets to the ecosystem through an intuitive Web UI. Unlike centralized platforms, Validata allows data providers to retain ownership and control of their contributions, ensuring that their work is recognized and fairly rewarded.
    Key Features:
    Story Protocol Integration – Every dataset is verified and tracked using NFTs minted via Story Protocol, ensuring on-chain IP attribution and enabling monetization.
    EthStorage Integration – Contributed datasets are stored immutably using decentralized storage solutions, ensuring security and accessibility.
    Direct Incentives – Users receive tokenized rewards based on the value and impact of their datasets on AI model training.
  2. AI Model Development and Deployment
    AI developers and researchers access the contributed datasets to train their models. Unlike traditional platforms, where data access is limited and expensive, Validata ensures that AI model development is permissionless and decentralized.
    Key Features:
    Permissionless Data Access – Developers browse and acquire datasets transparently.
    Benchmarking System – Instead of a centralized entity setting benchmarks, AI models are evaluated against user-defined performance metrics to ensure fairness and adaptability.
    Flow Smart Contracts – AI models are securely validated and registered through Flow-based smart contracts, ensuring immutable and trustless execution.

Key differentiators and uniqueness of the project

Key Differentiators
Incentivized High-Quality Data Contribution – Unlike traditional data-tracking solutions that merely record ownership, our system actively rewards users for providing valuable datasets. This not only ensures higher participation but also increases the quality and diversity of data available for AI training.
Self-Sustaining Data and AI Model Economy – Participants can earn rewards and reinvest them into AI model development or data acquisition, creating a self-funded ecosystem. This allows for sustainable AI innovation, reducing reliance on external funding or centralized control.
Beyond Data Tracking: A Full AI Development Lifecycle – While existing solutions focus solely on tracking and verifying data, we provide a complete infrastructure for AI model training, benchmarking, and deployment. Our ecosystem ensures that data is not just collected but also utilized in a transparent, trustless, and performance-driven manner.
Web3-Native Data and AI Model Ownership – Contributors retain ownership of their data through NFTs minted via Story Protocol, ensuring transparent attribution while enabling monetization and licensing. This is a significant improvement over centralized AI data repositories, where contributors have little to no control over how their data is used.
Permissionless Access and Cross-Chain Deployment – AI models trained on our platform can be deployed across multiple blockchains via Chainlink CCIP, expanding accessibility beyond a single ecosystem. Developers can leverage real-world data from various chains, enhancing AI models’ robustness and adaptability.
Transparent and Configurable Benchmarking – Instead of relying on opaque, centralized benchmarks, our platform allows users to define performance metrics and validate AI models against them. This ensures fairness, flexibility, and adaptability to different AI use cases, making the system transparent and performance-driven.

Trade-offs and shortcuts while building

Trade-offs and Shortcuts While Building
Developing Validata required a balance between delivering a functional product and optimizing for long-term innovation. Several trade-offs were made to prioritize usability and performance:
Optimizing for a Working Prototype – While many advanced features were envisioned, we focused on building a functional and scalable MVP first to ensure usability.
Prioritizing Core Infrastructure Over Additional Features – Some enhancements, such as more complex validation layers, had to be postponed to streamline development.
EVM vs. Non-EVM Chain Decisions – While EVM compatibility was a priority, we evaluated trade-offs to ensure Flow-based smart contracts were efficiently integrated for security and scalability.

Tracks Applied (7)

Most Killer App

Validata represents the ideal consumer-facing application for Flow, tackling the universal challenge of AI data accessib...Read More
Flow

Flow

Integrate Story in your DApp

Validata's integration with Story Protocol goes beyond basic IP tracking to create a comprehensive data rights managemen...Read More
Story

Story

Best Hook Design

The DatasetNftHook contract shows the innovative capabilities of Silo's hook system, it introduces a new way stake data ...Read More
Silo Finance

Silo Finance

Build a Decentralized Content Management System Using Web3:// and EthStorage

Validata uses EthStorage to create a decentralized content management system for AI training data. The implementation go...Read More
EthStorage

EthStorage

Best Use of CDP SDK

We use the coinbase wallet agent as well as on chain kit for a seamless bridge-to-web3 experience. It was important to u...Read More
Coinbase Developer Platform

Coinbase Developer Platform

Best use of Chainlink CCIP

Our use of Chainlink CCIP enables cross-chain AI agent deployment and data transfer. Through the CrossChainData contract...Read More
Chainlink

Chainlink

INFRASTRUCTURE + SCALABILITY

Validata is infrastructure for empowering decentralized AI development. Competition breeds progress, and right now there...Read More

Discussion

Builders also viewed

See more projects on Devfolio