Personality and Drug Consumption

Project Overview

Motivation
Research Questions
Methodology
Conclusions
Future directions
Presentation of the project

Getting started

Repository structure
Dependencies
How to run

Motivation

This project was motivated by the fact that drugs take a lot of lives. According to the CDC (2023) “In 2021, 106,699 drug overdose deaths occurred in the United States”. The main goal was to understand how different traits of personality interact with each other in determining the risk of drug consumption. Therefore, better prevention measures could be taken.

Research Questions

The two main research questions were:

Can we predict drug consumption based on personality?
Which dimensions of personality/combination of dimensions have more influence in determining drug consumption?

Methodology

To achieve the goal and answer the research questions, I utilized supervised machine learning algorithms for binary classification. I first conducted a detailed exploratory data analysis to get an understanding of the dataset's characteristics. I then employed various methodologies, including a decision tree classifier, a random forest classifier, logistic regression, and a deep learning model. To ensure optimal performance, I incorporated hyper-parameter fine-tuning techniques like grid search and cross-validation. Finally, I performed a thorough model evaluation by interpreting different metrics of the testing results, such as confusion matrix and recall scores.

Conclusions

Although none of the models proved to be sufficiently predictive, this experience allowed me to gain knowledge about the Machine Learning workflow and provided me with the necessary tools for future projects.
Research Question 1 - Personality by itself may not be enough to predict drug consumption.
Research Question 2 - No dimension of personality or combination of dimensions were found to have an important influence.
Other factors, such as demographics and life conditions should be taken into consideration.

Future directions

Try collecting more data (with more variables, diverse demographics, and samples).
Try using ensemble methods.

Presentation of the project

Repository Structure

Drug_Consumption.csv: personality and drug consumption data obtained from Kaggle
DrugsPersonality.ipynb: jupyter notebook containing exploratory data analysis, model exploration and evaluation
README.md: this file

Dependencies

python 3.11.4
numpy 1.23.5
pandas 1.5.3
graphviz 2.50.0
matplotlib 3.7.1
seaborn 0.12.2
keras 2.12.0
tensorflow 2.12.0

How to run

Clone the repository in a local machine
Use an integrated development environment such as VS code.
Make sure to have all the necessary dependendencies installed
Run

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Personality and Drug Consumption

Motivation

Research Questions

Methodology

Conclusions

Future directions

Presentation of the project

Repository Structure

Dependencies

How to run

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
Drug_Consumption.csv		Drug_Consumption.csv
DrugsPersonality.ipynb		DrugsPersonality.ipynb
README.md		README.md

natashacarpcast/drugsandpersonality

Folders and files

Latest commit

History

Repository files navigation

Personality and Drug Consumption

Motivation

Research Questions

Methodology

Conclusions

Future directions

Presentation of the project

Repository Structure

Dependencies

How to run

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages