DataDaddy
Own your data. Prove it. Get Paid.
Created on 15th March 2026
•
DataDaddy
Own your data. Prove it. Get Paid.
The problem DataDaddy solves
The global AI industry is starving for high-quality training data — but the existing pipeline is broken at every layer.
Companies pay data brokers thousands of dollars for stale, unverified datasets with no provenance and no accountability. Individual contributors generate enormous amounts of valuable data every day and receive nothing in return. And when labelling is needed, centralised annotation farms are opaque, expensive, and trivially gameable.
Data Daddy fixes all three simultaneously. We built a decentralised data marketplace on Base where companies post requests, verified contributors earn crypto for submitting responses, and a staking-based labelling pool enforces quality through economic incentives — not trust.
The labelling mechanism is the core insight: labellers stake ETH to submit a label, and only the majority consensus wins the pool. Bad actors don't just get ignored — they lose money. This makes dataset quality a direct function of financial skin in the game, removing the need for any centralised arbiter.
Privacy is enforced at the identity layer using Anon Aadhaar ZK proofs, letting contributors prove they are real, verified Indian citizens on-chain without revealing any personal data. Companies get demographic confidence; contributors keep their privacy.
The result is a trustless, permissionless data economy — where data quality is cryptographically enforced, contributor earnings are automatically settled on-chain, and no single entity controls the pipeline.
Challenges we ran into
The smart contract architecture was the first thing that humbled us. What seemed like a clean three-contract design on paper kept shifting as we actually built it — the staking and consensus logic in LabellingPool went through several rewrites before we stopped introducing new edge cases faster than we fixed old ones. Solidity has a way of making you feel stupid in very specific and educational ways.
Full-stack integration was death by a thousand async calls. Every layer worked fine on its own. The ZK proofs verified. The contracts deployed. The frontend connected to the wallet. And then you try to get all of it talking to each other — proof output feeding into contract params, transaction state syncing with UI, wagmi doing wagmi things — and suddenly nothing works and the error message is just undefined. A lot of hours went into that particular void.
Time was the invisible third constraint sitting on top of everything else. We had to make real calls about what to cut, what to stub, and what to ship half-finished. Some of those calls were right. Some we're still not sure about.
What we're proud of is that the core loop works — a company can request data, a contributor can submit it, and the labelling pool settles on-chain. That's the thing we wanted to prove, and we proved it.
Tracks Applied (4)
Privacy
Base
BEST Overall Project
Build What Big Tech Won't
Fileverse
Privacy
Technologies used
Cheer Project
Cheering for a project means supporting a project you like with as little as 0.0025 ETH. Right now, you can Cheer using ETH on Arbitrum, Optimism and Base.
