Skip to content
DataDaddy

DataDaddy

Own your data. Prove it. Get Paid.

Created on 15th March 2026

DataDaddy

DataDaddy

Own your data. Prove it. Get Paid.

The problem DataDaddy solves

The global AI industry is starving for high-quality training data — but the existing pipeline is broken at every layer.
Companies pay data brokers thousands of dollars for stale, unverified datasets with no provenance and no accountability. Individual contributors generate enormous amounts of valuable data every day and receive nothing in return. And when labelling is needed, centralised annotation farms are opaque, expensive, and trivially gameable.
Data Daddy fixes all three simultaneously. We built a decentralised data marketplace on Base where companies post requests, verified contributors earn crypto for submitting responses, and a staking-based labelling pool enforces quality through economic incentives — not trust.
The labelling mechanism is the core insight: labellers stake ETH to submit a label, and only the majority consensus wins the pool. Bad actors don't just get ignored — they lose money. This makes dataset quality a direct function of financial skin in the game, removing the need for any centralised arbiter.
Privacy is enforced at the identity layer using Anon Aadhaar ZK proofs, letting contributors prove they are real, verified Indian citizens on-chain without revealing any personal data. Companies get demographic confidence; contributors keep their privacy.
The result is a trustless, permissionless data economy — where data quality is cryptographically enforced, contributor earnings are automatically settled on-chain, and no single entity controls the pipeline.

Challenges we ran into

The smart contract architecture was the first thing that humbled us. What seemed like a clean three-contract design on paper kept shifting as we actually built it — the staking and consensus logic in LabellingPool went through several rewrites before we stopped introducing new edge cases faster than we fixed old ones. Solidity has a way of making you feel stupid in very specific and educational ways.
Full-stack integration was death by a thousand async calls. Every layer worked fine on its own. The ZK proofs verified. The contracts deployed. The frontend connected to the wallet. And then you try to get all of it talking to each other — proof output feeding into contract params, transaction state syncing with UI, wagmi doing wagmi things — and suddenly nothing works and the error message is just undefined. A lot of hours went into that particular void.
Time was the invisible third constraint sitting on top of everything else. We had to make real calls about what to cut, what to stub, and what to ship half-finished. Some of those calls were right. Some we're still not sure about.
What we're proud of is that the core loop works — a company can request data, a contributor can submit it, and the labelling pool settles on-chain. That's the thing we wanted to prove, and we proved it.

Tracks Applied (4)

Privacy

Data Daddy is built natively on Base, and privacy is the reason Base was the right chain for it. The problem we're solv...Read More
Base

Base

BEST Overall Project

Data Daddy solves a problem at the intersection of every major theme in Web3 right now — privacy, data ownership, crypto...Read More

Build What Big Tech Won't

Data Daddy uses Fileverse exactly the way this track was designed for — as the privacy layer that makes the whole system...Read More
Fileverse

Fileverse

Privacy

Most Web3 applications talk about privacy. Data Daddy is one of the few where privacy is the only reason the product wor...Read More

Cheer Project

Cheering for a project means supporting a project you like with as little as 0.0025 ETH. Right now, you can Cheer using ETH on Arbitrum, Optimism and Base.

Discussion

Builders also viewed

See more projects on Devfolio