A collection of Machine Learning projects built using Python and industry-standard ML libraries. This repository showcases end-to-end machine learning workflows, including data preprocessing, exploratory data analysis (EDA), feature engineering, model development, evaluation, and business insights.
Hi, I'm Harsh Poonia.
🎓 MCA Student at Thapar Institute of Engineering & Technology
I am passionate about Machine Learning, Data Analytics, and Software Development. This repository documents my journey of building real-world machine learning projects while strengthening my understanding of data-driven problem solving.
- Python
- Pandas
- NumPy
- Matplotlib
- Seaborn
- Scikit-Learn
- Jupyter Notebook
- VS Code
- Git
- GitHub
Projects in this repository typically follow:
- Data Collection
- Data Cleaning
- Exploratory Data Analysis (EDA)
- Feature Engineering
- Data Preprocessing
- Model Training
- Model Evaluation
- Performance Comparison
- Business Insights
- Model Persistence
| No. | Project | ML Type | Status |
|---|---|---|---|
| 1 | Customer Churn Analysis & Prediction | Classification | ✅ Completed |
| 2 | House Price Prediction | Regression | 🔄 Planned |
| 3 | Loan Approval Prediction | Classification | 🔄 Planned |
| 4 | Credit Risk Prediction | Classification | 🔄 Planned |
| 5 | Customer Segmentation | Clustering | 🔄 Planned |
| 6 | Recommendation System | Recommendation | 🔄 Planned |
| 7 | Email Spam Detection | NLP | 🔄 Planned |
| 8 | Movie Review Sentiment Analysis | NLP | 🔄 Planned |
| 9 | Sales Forecasting | Time Series | 🔄 Planned |
| 10 | Medical Diagnosis Prediction | Classification | 🔄 Planned |
Predict whether a telecom customer is likely to churn based on demographic information, account details, and service usage patterns.
- ✅ Data Cleaning & Preprocessing
- ✅ Exploratory Data Analysis (EDA)
- ✅ Feature Engineering
- ✅ Model Training
- ✅ Model Evaluation
- ✅ Model Comparison
- ✅ Feature Importance Analysis
- ✅ Model Saving
| Model | Accuracy | Precision | Recall | F1 Score |
|---|---|---|---|---|
| Logistic Regression | 0.807 | 0.647 | 0.607 | 0.626 |
| Decision Tree | 0.743 | 0.517 | 0.521 | 0.519 |
| Random Forest | 0.796 | 0.642 | 0.527 | 0.579 |
🏆 Logistic Regression
- Accuracy: 80.74%
- Precision: 64.67%
- Recall: 60.70%
- F1 Score: 62.62%
- Customers with shorter tenure are more likely to churn.
- Higher monthly charges increase churn risk.
- Long-term contracts improve customer retention.
- Fiber optic customers exhibit higher churn rates.
- Electronic check users are more likely to churn.
- Build 10+ Machine Learning Projects
- Strengthen Statistical Thinking
- Master Scikit-Learn
- Learn Model Optimization Techniques
- Explore NLP and Deep Learning
- Develop Industry-Ready ML Workflows
- Build a Placement-Ready Portfolio
- Data Cleaning
- Data Preprocessing
- Exploratory Data Analysis
- Feature Engineering
- Data Visualization
- Classification Modeling
- Regression Modeling
- Clustering
- Model Evaluation
- Feature Scaling
- Machine Learning Pipelines
- Business Insight Generation
Machine-Learning-Projects/
│
├── Customer_Churn_Analysis/
│
├── README.md
│
└── Future Projects...
- ✅ 1 Machine Learning Project Completed
- 🔄 9 Planned Projects Remaining
- 🎯 Building a diverse portfolio covering Classification, Regression, Clustering, NLP, Recommendation Systems, and Time Series Forecasting
https://github.com/Harshpoonia
⭐ This repository is continuously updated as I build and document new Machine Learning projects.