Skip to content

๐Ÿ“Š Data Analytics & Machine Learning Portfolio

Hi, I'm Harsvardhan Rajgarhia ๐Ÿ‘‹
3rd year B.Tech CSBS student | Aspiring Data Analyst & ML Engineer

This repository contains my end-to-end Data Analytics, Business Intelligence, and Machine Learning projects using Python, SQL, Power BI, Scikit-learn, Streamlit, Excel, and more.


โšก Tech Stack

Programming & Analytics

Python
Pandas
NumPy
Scikit-learn
Matplotlib
Seaborn
Plotly

Business Intelligence & Visualization

Power BI
Excel
Streamlit

Data Engineering & Storage

SQL
DuckDB


โ“ About Repository

This repository showcases my analytics and ML workflows, covering:

  • ๐Ÿ”„ Data Cleaning & Preprocessing
  • ๐Ÿ“Š Exploratory Data Analysis (EDA)
  • ๐Ÿ“ˆ Interactive Dashboards (Power BI / Streamlit)
  • ๐Ÿค– Machine Learning Models (Classification, Prediction & Insights)
  • ๐Ÿ’ก Business Recommendations
  • ๐Ÿ“˜ Documentation + Setup Guides for Each Project

Each project folder contains:

  • ๐Ÿ“‚ Raw & cleaned datasets
  • ๐Ÿ“˜ Documentation PDF
  • ๐Ÿ“ˆ Dashboard files
  • ๐Ÿงช Jupyter notebooks
  • ๐Ÿงน requirements.txt
  • ๐Ÿ“Š Screenshots

๐Ÿš€ Projects


1. ๐Ÿ“ก Customer Churn Analysis (Power BI + Python + SQL + ML)

End-to-end analytics & machine learning project

  • Tools: Python (Pandas, NumPy, Matplotlib, Seaborn, Scikit-learn), SQL (DuckDB), Power BI
  • Dataset: Telco Customer Dataset (~7,043 records)

๐Ÿ” Key Insights

  • Overall churn: 26.5%
  • Month-to-month contracts โ†’ churn ~42%
  • Fiber Optic users โ†’ churn ~41%
  • Electronic Check โ†’ highest churn among payment methods
  • New customers (<6 months) โ†’ highest churn risk

๐Ÿค– Machine Learning Model

  • Logistic Regression & Random Forest
  • Accuracy: ~83%
  • ROC-AUC: ~0.87
  • Exported churn probabilities + risk buckets for Power BI integration

2. ๐Ÿ‘ฅ Employee Attrition Analysis (Python & Streamlit)

  • Tools: Python (Pandas, Matplotlib, Seaborn, Plotly), Streamlit
  • Dataset: IBM HR Analytics (~1,470 employees, 35+ features)

๐Ÿ” Key Insights

  • Overall attrition: ~16%
  • Sales department โ†’ highest attrition (~20%)
  • Overtime employees โ†’ 3x more likely to leave
  • Low-income group (2.5kโ€“5.5k/month) โ†’ higher attrition
  • Younger age group (25โ€“35) โ†’ higher attrition

3. ๐Ÿ’น Sales & Profit Analysis Dashboard (Power BI)

  • Tools: Power BI, DAX, Power Query
  • Dataset: Simulated 20K+ sales transactions

๐Ÿ” Key Insights

  • Profit Margin ~29%
  • South region โ†’ top revenue (~โ‚น700K)
  • Product P010 โ†’ best performer
  • Bangalore Hub โ†’ highest sales

๐ŸŒŸ Future Work

  • Predictive Sales Forecasting
  • Time-Series Analysis
  • Deep Learning Projects
  • Portfolio Website

๐Ÿ‘ค Connect With Me

๐Ÿ“ง Email: [email protected]
๐Ÿ’ผ LinkedIn: **https://www.linkedin.com/in/harshvardhan-rajgarhia-ba6