This repository is to help all chess players utilize and work with public chess profile data provided by the FIDE
organization (through December 2019). FIDE's website can be cumbersome to work with and has limited visuals about chess players. My hope is that this project helps users avoid the pain of using their website and provide interesting insights to the chess community.
The project is divided into 6 steps, which are outlined here.
To see ratings in real-time, I've created a dashboard to help chess players here.
The data is located in this folder.
The folder contains over a hundred .csv
files, from January 2001 to December 2019, that can be analyzed.
If you have suggestions on different data formats you'd like, please state them in Issues.
Other than that, feel free to use it as you see fit.
An example of what the FIDE Standard Rating data looks like in December 2019:
ID_NUMBER | Name | Fed | Sex | Tit | WTit | OTit | FOA | Rating | Gms | K | Birthday | Flag |
---|---|---|---|---|---|---|---|---|---|---|---|---|
25121731 | A C J John | IND | M | 1063 | 0 | 40 | 1987 | |||||
35077023 | A Chakravarthy | IND | M | 1151 | 0 | 40 | 1986 | i | ||||
10207538 | A E M, Doshtagir | BAN | M | 1840 | 0 | 40 | 1974 | i | ||||
10680810 | A hamed Ashraf, Abdallah | EGY | M | 1728 | 0 | 40 | 2001 | |||||
5716365 | A Hamid, Harman | MAS | M | NI | 1325 | 0 | 40 | 1970 | i |
From the FIDE Download Rating List page (old), we can understand each column a bit more:
Column name | Meaning | Example |
---|---|---|
ID_NUMBER | a FIDE player's ID | 123456 |
Name | a FIDE player's name | Carlsen, Magnus |
Fed | a FIDE player's federation | USA |
Sex | a FIDE player's sex | M, F |
Tit | a FIDE player's title | GM, IM, FM, etc. |
OTit | a FIDE player's other title(s)** | IA, FT, NI, etc. |
FOA | a FIDE player's FOA*** titles | AGM, AIM, AFM, etc. |
Rating | a FIDE player's rating | 2168 |
Gms | # of games played in a month | 46 |
K | a FIDE player's K-factor | 40 |
Birthday | a FIDE player's birth year | 1993 |
Flag | a FIDE player's level of activity | i, wi |
The lifecycle of the data is divided into 6 steps (below).
All 6 steps are done through R and Python.
You can click each step below for more information.
Step #1 | Download the data |
Step #2 | Reformating the data |
Step #3 | Scraping country data |
Step #4 | Cleaning the data |
Step #5 | Visualizing the data |
Step #6 | Future Work |
I chose to work on this project because of several reasons:
-
FIDE's publically available data is in an unorganized layout. There is no "download all datasets" button to acquire all of their data. FIDE also doesn't publically list data prior to February 2015 on the new download page when there is actually data going back as early as 2001. As a result, site visitors may find FIDE's website frustrating to work with and I wanted to help overcome their struggles.
-
Chess players like to see visuals of themselves, friends, competitors, top players and players across various demographics. I wanted to provide these visuals.
-
I wanted to improve my skills in 2 programming languages.
-
I've always taken an interest in any data about chess that has not been extensively analyzed. My curiosity drives me.
You can clone this repo with the following:
$ git clone https://github.com/AnujDahiya24/FIDE-Chess-Data
Please post inquiries about the data in Issues.