Skip to content

Soumalya Banik

@SoumalyaBanik

Data Analysis
Artificial Intelligence
Deep Learning
Skill iconGitHub
Web Application Development

Kolkata, India

I’m Soumalya Banik, a Computer Science and Engineering student specializing in Artificial Intelligence and Machine Learning, driven by the challenge of building AI systems that don’t just work in theory but perform reliably in real-world, high-stakes environments.

My work sits at the intersection of AI for healthcare, behavioral analysis, and advanced computer vision, with a proven track record of developing robust, multi-modal detection pipelines that integrate video, audio, and sensor data.

Flagship Project – Autism Behavior Detection Toolkit
Designed and implemented a multi-modal AI pipeline integrating YOLOv8, MediaPipe Holistic, and OpenAI Whisper to detect and classify 9+ clinically significant autism-related behaviors (e.g., eye contact avoidance, self-hitting, atypical language patterns) from short video/audio clips.
• Achieved 92.8% accuracy across multiple behavioral categories in lab and real-world datasets.
• Developed a modular, extensible architecture for integrating new behavioral models without retraining the entire system.
• Produced IEEE-format research paper in collaboration with researchers at IIT Bhubaneswar.
• Designed for practical deployment in clinical, research, and assistive tech contexts — from autism screening tools to caregiver alert systems.

Other Notable Work
• Oncology Diagnostics AI – Brain tumor classification from MRI scans using DenseNet121 with four-class output (Glioma, Meningioma, Pituitary, No Tumor) and advanced preprocessing pipelines.
• Women’s Safety Platform – GPS-triggered distress alert system with geo-lock encrypted reporting, designed for high-speed, secure incident response.
• Face Recognition Door Lock – Raspberry Pi-based system for secure facial authentication with local recognition and real-time access control.
• AI-based Resume Builder – ATS-optimized resume generator powered by GPT models, producing custom professional resumes in seconds.

Core Skills
• AI/ML: Deep Learning, Computer Vision, NLP, Multi-Modal AI, Model Optimization.
• Frameworks/Tools: YOLOv8, MediaPipe, OpenCV, TensorFlow, PyTorch, Whisper AI.
• Software Development: Full-stack web apps (React, Next.js, Tailwind, Django, Angular), scalable backend APIs.
• Hardware Integration: Raspberry Pi, Arduino, IoT Systems, Edge AI Deployment.
• Research & Analytics: IEEE-style research writing, academic presentations, performance benchmarking.

Links
LinkedIn: https://www.linkedin.com/in/soumalya-banik/
GitHub: https://github.com/SoumalyaBanik