Data Preparation Script for Custom Face Dataset that are collected from Google Image for InsightFace Project

Preparation steps:

Install required package if you are not using GPU

pip install mxnet

or if you are using GPU, please install mxnet-cu92 instead

pip install mxnet-cu92

Install Selenium

pip install selenium

Install Webdriver

pip install webdriver

Install insightface

pip install insightface

Before running, you need to list down the name of famous person that you would like to search from google in a file named face_name_list.dat

Running the scripts

Run the data preparation script in the following order:

01_collect_faces.py, by default will generate all raw faces into ./downloads directory. You need to make sure that the first file is the anchor image for face comparison to select valid faces in the next steps
02_prepare_faces.py, by default will validate faces and move the valid faces to ./faces directory
03_split_data_sets.py, by default will generate datasets into train, verification, and test (80:10:10), represented by train.part, ilfw.part, and ilfw-test.part file in ./faces directory
04_generate_train.py, by default will generate a train.lst file in ./faces directory, generate train.rec, train.idx, and property file in ./ilfw directory
05_generate_validation.py, by default will generate pairs.txt file in ./faces directory, and generate .bin file in ./ilfw directory

Notes: ILFW is short for Indonesian Labelled Face in the Wild

When properly run, the dataset will create:

train.idx
train.rec
property
ilfw.bin
ilfw-test.bin

Before training:

Copy your dataset to folder datasets, and assign your dataset variable for the training as follow:

dataset.emore = edict()
dataset.emore.dataset = 'emore'
dataset.emore.dataset_path = '../datasets/ilfw'
dataset.emore.num_classes = <the number of identities>
dataset.emore.image_shape = (112,112,3)
dataset.emore.val_targets = ['ilfw', 'ilfw-test']

TODO

More detailed explanation

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
images		images
modules		modules
.env		.env
.gitignore		.gitignore
1A_collect_faces.py		1A_collect_faces.py
1A_collect_faces.py (1).sh		1A_collect_faces.py (1).sh
1A_collect_faces.py.sh		1A_collect_faces.py.sh
1B_check_duplicate_face.py		1B_check_duplicate_face.py
2A_prepare_faces.py		2A_prepare_faces.py
2B_check_completeness.py		2B_check_completeness.py
2C_add_face_masks.py		2C_add_face_masks.py
3_split_data_sets.py		3_split_data_sets.py
4_generate_train.py		4_generate_train.py
5_generate_validation.py		5_generate_validation.py
98_check_small_counts.py		98_check_small_counts.py
99_fix_inconsistent_naming.py		99_fix_inconsistent_naming.py
README.md		README.md
face_name_list.dat		face_name_list.dat
inference-failed-matching.py		inference-failed-matching.py
inference-test (1).py		inference-test (1).py
inference-test.py		inference-test.py
settings (1).json		settings (1).json
settings.py		settings.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data Preparation Script for Custom Face Dataset that are collected from Google Image for InsightFace Project

Preparation steps:

Running the scripts

Before training:

TODO

About

Releases

Packages

Languages

wwidjaya/insightface-ilfw-data-preparation

Folders and files

Latest commit

History

Repository files navigation

Data Preparation Script for Custom Face Dataset that are collected from Google Image for InsightFace Project

Preparation steps:

Running the scripts

Before training:

TODO

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages