DataMask
Human-AI Anonymization for Trust
Created on 8th November 2024
•
DataMask
Human-AI Anonymization for Trust
The problem DataMask solves
DataMask Pro is an advanced data masking and obfuscation tool designed to protect Personally Identifiable Information (PII) within diverse data formats while maintaining the usability and analytical value of the data. The tool serves industries such as healthcare, finance, legal, e-commerce, and AI/ML research, ensuring that sensitive data can be safely used in testing, analysis, and model training workflows without privacy risks. It is built to comply with stringent data privacy laws, including GDPR, HIPAA, and CCPA.
This unified tool provides flexible data protection options, supporting redaction, obfuscation, and context-aware anonymization across file types, with a seamless pipeline for integration into data processing and machine learning environments.
Challenges I ran into
One of the major challenges we faced was balancing privacy and data usability. We wanted to ensure that after masking PII, the remaining data would still be useful for analysis. Creating a tool that could handle diverse data formats—while maintaining accuracy—was another hurdle. Ensuring real-time processing without compromising performance and integrating user-defined PII detection with an SLM added additional layers of complexity.
Tracks Applied (3)
Best Use of MongoDB Atlas
Major League Hacking
Best use of GitHub
GitHub Education
Best Project Built Using Gemini API
Google For Developers

