Fraud Detection Project

Overview

This project is a fraud detection system that leverages machine learning models to identify fraudulent transactions. It includes data preprocessing, synthetic data generation, model training, and a web-based interface for results visualization.

Features

Preprocessing and cleaning raw transaction data
Generating synthetic data to enhance training datasets
Building and training a Random Forest model
Visualizing predictions and insights through a web dashboard

Requirements

To run this project, you need:

Python (>=3.9)
Libraries specified in req.txt

Install Dependencies

bash pip install -r req.txt

Usage

Clone the Repository bash git clone cd CODE/fraud_detection_project
Prepare the Data Place your data files in the data/ folder.
Run the Web Application bash python app.py
Optional: Retrain the Model If you need to retrain the model: bash python src/model/train_model.py

Project Structure

fraud_detection_project/ ├── app.py
├── main.py
├── synthetic_data.py

data/
├── final_dataset.csv ├── new_transaction_data.csv ├── predictions.csv ├── transactions.csv └── transaction_history_unlocked.pdf

src/
├── generate_synData.py
├── pdf_to_csv.py
├── predict.py
├── visualize_predictions.py

src/model/
├── best_random_forest_model.joblib ├── train_model.py
├── evaluate_model.py
├── preprocessed_data.joblib
└── preprocessor.joblib

src/preprocess/
└── preprocess_data.py

static/
├── css/
└── images/

templates/
├── analysis.html ├── dashboard.html ├── index.html └── visualize.html

req.txt

How It Works

Data Preprocessing
The preprocess_data.py script cleans and prepares raw transaction data for training.
Synthetic Data Generation
generate_synData.py creates synthetic datasets to augment training data.
Model Training
train_model.py trains a Random Forest model to detect fraudulent transactions.
Prediction and Visualization
predict.py runs predictions, and the results are displayed in a user-friendly web interface.

Libraries and Tools

This project leverages the following libraries and tools:

Python: Programming language used for implementation.
Scikit-learn: For machine learning model development and evaluation.
Pandas: For data preprocessing and manipulation.
Joblib: For saving and loading model and preprocessor artifacts.
Flask: To build the web application and API endpoints.
Matplotlib/Seaborn: For data visualization and exploratory analysis.

Data Sources

Synthetic Dataset: Created using generate_synData.py to supplement training data.
Raw Transaction Data: Processed using scripts in the preprocess/ and data/ directories.

Acknowledgments

OpenAI: For providing tools and guidance during the project.
Community Contributors: Special thanks to open-source community contributors for shared knowledge and resources.

License

This project is licensed under the MIT License. See the LICENSE file for full details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fraud Detection Project

Overview

Features

Requirements

Install Dependencies

Usage

Project Structure

How It Works

Libraries and Tools

Data Sources

Acknowledgments

License

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
app.py		app.py
classification_report.png		classification_report.png
evaluate_model.py		evaluate_model.py
final_dataset.csv		final_dataset.csv
fraudulent_distribution.png		fraudulent_distribution.png
generate_synData.py		generate_synData.py
main.py		main.py
new_transaction_data.csv		new_transaction_data.csv
pdf_to_csv.py		pdf_to_csv.py
predict.py		predict.py
predictions.csv		predictions.csv
preprocess_data.py		preprocess_data.py
req.txt		req.txt
train_model.py		train_model.py
visualize_predictions.py		visualize_predictions.py

KRIPA184/design-project-1

Folders and files

Latest commit

History

Repository files navigation

Fraud Detection Project

Overview

Features

Requirements

Install Dependencies

Usage

Project Structure

How It Works

Libraries and Tools

Data Sources

Acknowledgments

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages