ClariFi
Track Your Money, Shape Your Future.
The problem ClariFi solves
The Problem ClariFi Solves
Managing personal finances is often messy, time-consuming, and stressful. People struggle with:
- Tracking multiple income sources and expenses: Manually juggling apps, spreadsheets, and bills.
- Digitizing and organizing receipts: Paper bills and invoices get lost or are hard to manage.
- Preparing reports for taxes or accounting: Time-consuming manual calculations and formatting.
- Understanding spending habits: Hard to get actionable insights to improve financial decisions.
How ClariFi helps:
- Provides a unified dashboard to see all finances in one place.
- Automates tedious tasks like bill digitization and report generation.
- Offers AI-driven insights for smarter money decisions.
- Makes finance management faster, safer, and stress-free, helping users focus on their work and goals instead of manual tracking.
Challenges we ran into
Our biggest challenge was the inconsistent accuracy of our OCR feature on diverse Indian receipts. Initial tests showed frequent errors in extracting data like GSTIN and total amounts due due to varied formats and poor image quality. This was a critical issue, as inaccurate data would make the entire app unreliable.
To overcome this, we engineered a smarter, AI-driven solution by building a custom wrapper around the Gemini API:
1. Image Pre-processing
Our wrapper first runs a lightweight pre-processing step on every uploaded image. It automatically:
- Enhances contrast
- De-skews the receipt
- Crops it to the relevant area
This ensures Gemini receives the cleanest possible input for analysis.
2. The Gemini Wrapper & Prompt Engineering
Instead of building a complex recognition model from scratch, we built a dedicated API wrapper. This wrapper:
- Crafts a detailed prompt instructing the Gemini vision model to act as a financial data extractor
- Specifically asks it to identify key fields—like "Total", "GSTIN", and "Vendor Name"
- Returns the findings in a structured JSON format
This approach is far more reliable than trying to parse raw text output.
3. User Verification and Feedback Loop
After the wrapper receives the structured JSON data from Gemini, it populates a confirmation screen. This allows the user to:
- Quickly verify the extracted data for 100% accuracy
- Provide a crucial feedback loop to refine prompts and image pre-processing for even better performance over time
By leveraging Gemini's powerful multimodal capabilities through a smart wrapper, we solved a complex engineering problem efficiently, turning our unreliable OCR into a core, highly accurate and trustworthy feature of the app.
Tracks Applied (1)
$300(Open): Cash Prize
Technologies used