Resume Matching Tool

This repository contains a comprehensive approach to matching resumes with job descriptions using natural language processing (NLP) techniques. The goal of this project is to automate the process of suggesting suitable job titles based on resume content, improving upon traditional keyword-matching methods commonly found in job search platforms.

Introduction

Job recommender systems are critical tools for job seekers, yet existing methods often rely on simplistic keyword matching. This project focuses on advancing the state-of-the-art in job matching using NLP and machine learning techniques. The primary objective is to develop a model that can analyze a resume and recommend the most suitable job title based on its content.

Approach

The project encompasses several key components:

Literature Review: Investigates existing methodologies in resume parsing and job matching, highlighting the use of NLP tools like cosine similarity, TF-IDF, and advanced embedding techniques.
Data Collection and Preprocessing:
- Resumes: Retrieved from Kaggle and categorized into technical job titles.
- Job Descriptions: Scraped from Indeed.com for 12 specified job titles.
Exploratory Data Analysis: Examines the distribution and common terms within the resume and job description datasets.
Matching Methodology:
- Data Preprocessing: Includes text normalization, skill extraction using spaCy's Named Entity Recognition (NER), and TF-IDF vectorization.
- Models Tested:
  - Cosine Similarity (Mode and Average)
  - Logistic Regression
  - Random Forest
Evaluation: Measures model performance using accuracy, recall, precision, and F1-score metrics.
Interactive Tool Development:
- Implements a tool for BSE students to upload resumes and receive suggested job titles, along with identified missing skills.

Results and Conclusion

The project demonstrates that leveraging advanced machine learning models like logistic regression and random forest significantly improves resume matching accuracy over traditional cosine similarity methods. The findings underscore the effectiveness of NLP techniques in automating the job matching process.

Repository Structure

data/: Contains datasets used for training and evaluation.
Notebook-JobCVMatcher: Jupyter notebooks detailing data preprocessing, model training, and evaluation.
src/: Source code for the library of util functions.
README.md: Overview of the project, methodology, and key findings.

Getting Started

To explore the project:

Clone the repository.
Review the Jupyter notebooks in the Notebook-JobCVMatcher notebook for detailed steps.
Run the interactive tool at the end of the Notebook-JobCVMatcher notebook to match resumes with job titles.

For further details and usage instructions, refer to the complete report and project documentation.

This repository serves as a comprehensive exploration of NLP techniques for resume matching, offering insights into effective methodologies for job recommendation systems. The interactive tool developed as part of this project provides practical utility for BSE students seeking tailored job suggestions based on their resumes.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
data		data
src		src
Notebook-JobCvMatcher.ipynb		Notebook-JobCvMatcher.ipynb
README.md		README.md
Report-JobCvMatcher.pdf		Report-JobCvMatcher.pdf
Scraping_indeed.ipynb		Scraping_indeed.ipynb
bse_logo.jpeg		bse_logo.jpeg
training_data_skills.spacy		training_data_skills.spacy

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Resume Matching Tool

Introduction

Approach

Results and Conclusion

Repository Structure

Getting Started

About

Releases

Packages

Languages

adgianv/NLP-ReccommenderSystem-Job_CV_Matching

Folders and files

Latest commit

History

Repository files navigation

Resume Matching Tool

Introduction

Approach

Results and Conclusion

Repository Structure

Getting Started

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages