Customer Churn Prediction Model

Overview

This project implements a machine learning solution to predict customer churn using various classification algorithms. The model helps identify customers who are likely to discontinue services, enabling proactive retention strategies.

Features

Data preprocessing and exploratory data analysis
Implementation of multiple machine learning models:
- XGBoost Classifier
- Random Forest Classifier
- Decision Tree Classifier
- Support Vector Machine (SVM)
- K-Nearest Neighbors (KNN)
- Neural Network
Model evaluation and comparison
Interactive web interface using Streamlit
Model persistence using pickle
Handling imbalanced data using SMOTE

Installation

Prerequisites

Python 3.11
Conda (recommended for environment management)

Environment Setup

Create a new conda environment

conda create -n churn-prediction python=3.11

Activate the environment

conda activate churn-prediction

Install required packages

pip install -r requirements.txt

Required Packages

numpy
pandas
scikit-learn
streamlit
xgboost
imbalanced-learn
python-dotenv
matplotlib
seaborn
plotly

Usage

Running the Jupyter Notebook

jupyter notebook churn.ipynb

Running the Streamlit App

streamlit run main.py

Project Structure

customer-churn/ ├── churn.ipynb # Main notebook with model development ├── main.py # Streamlit application ├── churn.csv # Dataset ├── models/ # Saved model files │ ├── dt_model.pkl │ ├── knn_model.pkl │ ├── rf_model.pkl │ ├── svm_model.pkl │ └── xgb_model.pkl ├── requirements.txt # Project dependencies └── README.md # Project documentation

Model Performance

The project implements and compares several machine learning models for churn prediction. Each model is evaluated using metrics such as accuracy, precision, recall, and F1-score.

Environment Variables

Create a .env file in the project root with the following variables:

GROQ_API_KEY=<your-groq-api-key>

Contributing

Fork the repository
Create a new branch
Make your changes
Submit a pull request

License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.streamlit		.streamlit
__pycache__		__pycache__
.gitignore		.gitignore
README.md		README.md
churn.csv		churn.csv
churn.ipynb		churn.ipynb
dt_model.pkl		dt_model.pkl
knn_model.pkl		knn_model.pkl
main.py		main.py
nb_model.pkl		nb_model.pkl
requirements.txt		requirements.txt
rf_model.pkl		rf_model.pkl
runtime.txt		runtime.txt
svm_model.pkl		svm_model.pkl
utils.py		utils.py
voting_clf.pkl		voting_clf.pkl
xgb_model.pkl		xgb_model.pkl
xgboost-SMOTE.pkl		xgboost-SMOTE.pkl
xgboost-featureEngineered.pkl		xgboost-featureEngineered.pkl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Customer Churn Prediction Model

Overview

Features

Installation

Prerequisites

Environment Setup

Create a new conda environment

Activate the environment

Install required packages

Required Packages

Usage

Running the Jupyter Notebook

Running the Streamlit App

Project Structure

Model Performance

Environment Variables

Contributing

License

Contact

About

Releases

Packages

Languages

Paul-Clue/customer-churn2

Folders and files

Latest commit

History

Repository files navigation

Customer Churn Prediction Model

Overview

Features

Installation

Prerequisites

Environment Setup

Create a new conda environment

Activate the environment

Install required packages

Required Packages

Usage

Running the Jupyter Notebook

Running the Streamlit App

Project Structure

Model Performance

Environment Variables

Contributing

License

Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages