Disease Prediction System(DPS)
Prevention Starts with Prediction
Created on 7th September 2025
โข
Disease Prediction System(DPS)
Prevention Starts with Prediction
The problem Disease Prediction System(DPS) solves
The Disease Prediction System can be used to:
๐งช Early Diabetes Screening โ Quickly check whether a person is at risk of diabetes using basic health details.
โก Fast Health Assessment โ Enter simple inputs like glucose, BMI, and age to get an instant prediction.
๐ฅ Support for Doctors & Clinics โ Acts as a decision support tool for doctors, helping them screen patients faster before recommending detailed tests.
๐จโ๐ฉโ๐ง Self-Check for Individuals โ Anyone can use it from home to get a preliminary idea of their health condition.
๐ Accessible Anywhere โ Runs online on Streamlit Cloud, so no installation or setup is required.
๐ How It Makes Tasks Easier & Safer
โ Saves Time โ No need to wait for multiple medical tests just for a first-level check.
โ Increases Awareness โ Encourages people to monitor their health regularly.
โ Assists Healthcare โ Doctors can quickly identify high-risk patients.
โ Accessible & Safe โ Works on any device with internet, making health predictions more inclusive.
Challenges I ran into
While building this project, one major hurdle we faced was:
โก Mismatch in Features During Prediction
Initially, when we tried to predict the output for a new patient, the app crashed with an error:
ValueError: X has 7 features, but StandardScaler is expecting 8 features as input
This happened because the model was trained on 8 features (Pregnancies, Glucose, Blood Pressure, Skin Thickness, Insulin, BMI, Diabetes Pedigree Function, Age), but during prediction we accidentally passed fewer inputs.
โ How We Solved It
We carefully checked the dataset columns and matched them with the form inputs.
Added all 8 inputs in the form (Pregnancies, Glucose, BP, Skin Thickness, Insulin, BMI, DPF, Age).
After fixing this mismatch, the model worked correctly and predictions were generated smoothly.
๐ฏ Learning
Always ensure that the input format for prediction matches exactly with the training data.
Debugging step by step helped us identify where the mismatch occurred.
This taught us the importance of data consistency in ML pipelines.