Overconfidence and Financial Behaviors

Overconfidence and Financial Behaviors

I investigate whether overconfidence in financial literacy affect households' financial behaviors through the lens of retirement readiness, precautionary savings, and financial market participation. Machine learning classifiers are employed to measure overconfidence.

The repository consists of four parts, namely codes, data, outputs, and docs.

1. Codes

The codes are divided into three parts.

(a) Clean the NFCS dataset

The codes are shown in the Stata do file process_NFCS.do and the log file is uploaded to the outputs/logs folder. I extract necessary variables (financial behaviors, demographic characteristics, perceived & true financial literacy) from the raw data, merge the education data from the 2012 study, and do some basic cleaning. The cleaned data are exported to processed_NFCS.csv in the data folder.

(b) Construct the overconfidence measures using machine learning classifiers

The codes are shown in the jupyter notebook overconfidence_measure.ipynb. After read the pre-cleaned data, I first examine the fundamental patterns of perceived and true financial literacy to form an initial impression of the data. Then I construct a learning set where households can be unambiguously categorized as overconfident or not overconfident. After that I train the classifiers with the learning set and compare the performances of them by MSE. The MSEs of different classifiers are exported as MSE.png to the outputs/figures folder, and as MSE.csv to the outputs/tables folder. Finally I generate the out-of-sample predictions and export the dataset as overconfidence_measure.csv in the outputs/tables folder for later use. Several figures are also exported to the outputs/figures.

(c) Explore the effects of overconfidence

The codes are shown in the Stata do file Analysis.do and the log file is uploaded to the outputs/logs folder. I translate overconfidence_measure.csv into excel and put it into the data folder before I read the data into Stata because of the precision loss when importing .csv files into Stata. I run the logit regression model (3) as presented in the paper to see whether overconfidence affects retirement readiness, precautionary savings, and financial market participation of households. Heterogeneous effects are also investigated. Multiple tables are exported to the outputs/tables folder.

2. Data

The data of my study come from the National Financial Capability Studies (NFCS), which can be downloaded here. To be specific, I utilized the 2018 State-by-State Survey — Tracking Dataset, Comma delimited Excel file (.csv) and the 2012 State-by-State Survey — Respondent-Level Data, Comma delimited Excel file (.csv) to complete my analyses. They are also uploaded to this folder.

3. Docs

This folder contains the poster and the paper for this project.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overconfidence and Financial Behaviors

1. Codes

(a) Clean the NFCS dataset

(b) Construct the overconfidence measures using machine learning classifiers

(c) Explore the effects of overconfidence

2. Data

3. Docs

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
codes		codes
data		data
docs		docs
outputs		outputs
README.md		README.md

WMhYang/Overconfidence-Financial-Behaviors

Folders and files

Latest commit

History

Repository files navigation

Overconfidence and Financial Behaviors

1. Codes

(a) Clean the NFCS dataset

(b) Construct the overconfidence measures using machine learning classifiers

(c) Explore the effects of overconfidence

2. Data

3. Docs

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages