Synthi

Decentralized Synthetic Data Marketplace

Built at HACKSTROM 2025

Created on 1st November 2025

•

Synthi

Decentralized Synthetic Data Marketplace

The problem Synthi solves

The world of AI runs on data, but the best data is a toxic asset: incredibly valuable but too dangerous to touch. This creates a set of critical, expensive problems for every company trying to innovate today.

The Privacy Paradox
To create "safe" synthetic data, companies are forced to send their most sensitive, private customer information (health records, financial data) to a centralized third-party vendor.
The Pain: This creates a catastrophic security risk. You have to expose your most valuable asset to protect it, which makes no sense. A single data breach at the vendor can lead to crippling fines, lawsuits, and a complete loss of customer trust.
The Quality Gamble
The current process is built on blind trust. Companies must pay large sums of money upfront and hope the synthetic data they get back is actually useful.
The Pain: This turns every AI project into an expensive gamble. There is no independent, verifiable proof of quality before payment. If the data is bad, the project is delayed for months, and the money is wasted fighting for a refund.
The Innovation Bottleneck
Because of these risks, the most valuable data in the world (in hospitals, banks, and research labs) remains locked away in silos, completely unusable.
The Pain: This directly stifles the pace of innovation. Companies are forced to make an impossible choice:
Option A: Risk a massive data breach.
Option B: Use poor-quality data and build mediocre AI.
Option C: Do nothing and get left behind.
In a Nutshell:
We are eliminating the impossible choice that companies face today: sacrifice privacy for innovation, or sacrifice innovation for privacy. Our platform allows them to do both, securely and efficiently.

Challenges we ran into

The Resource Challenge: Running Billion-Parameter Models on a Budget
The Problem: State-of-the-art generative models (like Gemma, Llama) are incredibly powerful but demand massive GPU memory (VRAM) and compute power. Building a profitable service on top of them seemed economically unfeasible without huge infrastructure costs.
Our Solution: We engineered a highly-optimized AI stack from the ground up. By implementing Unsloth, 4-bit quantization, and LoRA fine-tuning, we successfully reduced the memory footprint of our models by over 70%. This breakthrough is the technical foundation of our competitive pricing and high-margin business model.
The Integration Challenge: Bridging the AI & Web3 Divide
The Problem: Blockchains are deterministic and can't run complex, probabilistic AI models. This created a huge hurdle: How could our on-chain smart contract trust the result of an off-chain quality check? This is the classic "AI Oracle Problem."
Our Solution: We developed a "Proof of Quality" system. The complex AI verification runs off-chain. The resulting QA report is published to IPFS, and its unique cryptographic hash is submitted to the blockchain. The smart contract only needs to verify this lightweight, tamper-proof hash to trigger payment or refund, creating a secure and trustless bridge between the two worlds.
The Usability Challenge: Abstracting Web3 Complexity
The Problem: The target users—data scientists and enterprise clients—are not crypto-native. The friction of managing wallets, gas fees, and signing transactions would have been a major barrier to adoption.
Our Solution: We designed a hybrid architecture with a familiar Web2 user interface. Our front-end handles user requests seamlessly, while the backend orchestrates the complex Web3 interactions (escrow, verification, and settlement) automatically. This provides the user with a simple, intuitive experience while retaining the full security and transparency benefits of a decentralized backend.

Tracks Applied (1)

Ethereum Track

Ourproject is a Decentralized Synthetic Data Marketplace built on the Ethereum blockchain. Here's how it aligns with the...Read More

ETHIndia

Technologies used

Solidity

React

Node.js

JavaScript

Machine Learning

Deep Learning

Python

Tailwind CSS

Discussion

Builders also viewed

See more projects on Devfolio