Your Name (id)103011105 賴怡惠

#Project 4 / Face Detection with a Sliding Window

Overview

The project is related to

train a classifier of Linear SVM to distinguish faces

Implementation

get_positive_features.py

goal: Load positive trained images and convert them to HoG features
implementation:

traverse all the files ending with jpg to transform them into hog features

	image_files=[f for f in listdir(train_path_pos) if f.endswith('.jpg')]
	num_img=len(image_files)
	D=pow(feature_params['template_size']/feature_params['hog_cell_size'],2)*31
	D=int(D)
	features_pos=np.zeros((num_img,D))
	for i in range(num_img):
    		path=train_path_pos+'/'+image_files[i]
    		img=imread(path,as_grey=True)
    		features_pos[i]=np.reshape(hog(img,feature_params['hog_cell_size']),(-1,))

get_random_negative_features.py

goal: Random choose negative examples from scenes which contain no faces and convert them to HoG features
implementation:

according to the instructions in the code:This funciton should return negative training examples (non-faces) from any images in 'non_face_scn_path'. Images should be converted to grayscale.

	...traverse through all the non-faces images read as previous step in get_positive_features.py
	# decide the number of samples
	if min(len(img[0])-feature_params['template_size'],len(img)-feature_params['template_size'])<smp_per_img:
	    num_sampd=min(len(img[0])-feature_params['template_size'],len(img)-feature_params['template_size'])
	else:
	    num_sampd=smp_per_img
	# randomly choose #num_sampd of indices
	idx_i=np.random.choice(np.arange(len(img)-feature_params['template_size']),num_sampd,replace=False)
	idx_j=np.random.choice(np.arange(len(img[0])-feature_params['template_size']),num_sampd,replace=False)
	for x in range(num_sampd):
		# get hog features from randomly chosen portion of each the non-faces image
		    portion=img[idx_i[x]:idx_i[x]+feature_params['template_size'],idx_j[x]:idx_j[x]+feature_params['template_size']]
		    hogged=hog(portion, feature_params['hog_cell_size'])
		    port=np.reshape(hogged,(1,-1))
		    all_images = np.concatenate([all_images, port], axis=0)

svm_classify.py
- goal:Train a linear classifier from both the positive and negative images
- implementation:
try the constant from 0.1、0.01、0.001 to 0.0001
```
	clf = svm.LinearSVC(C=0.01)
	y =np.ravel(y)
	clf.fit(x, y) 
```

run_detector.py

goal: Run the classifier on the test set. For each image, run the classifier at multiple scales and then call non_max_supr_bbox.py to remove duplicate detections
implementation:

according to the instructions in the code:Your actual code should convert each test image to HoG feature space with a single call to vl_hog for each scale. Then step over the HoG cells,taking groups of cells that are the same size as your learned template,and classifying them. If the classification is above some confidence,keep the detection and then pass all the detections for an image to non-maximum suppression.

	...traverse each images and read them as previous step in get_positive_features.py
	while count*mindim>feature_params['template_size']:
	    frame=resize(img,[int(len(img)*count),int(len(img[0])*count)])
	    #convert eacg test image to hog features
	    hogged=hog(frame,cell_size)
	    D=pow(cell_num,2)*31
	    D=int(D)
	    #over the HoG cells
	    for k in range(int(len(hogged)-cell_num+1)):
		for j in range(int(len(hogged[0])-cell_num+1)):     
			curr_y_min=k
			curr_x_min=j
			#taking groups of cells that are the same size as your learned templat
			mini_hog=hogged[curr_y_min:curr_y_min+cell_num,curr_x_min:curr_x_min+cell_num]
			bfeat=np.reshape(mini_hog,(1,-1))
			tmp_score=np.reshape(model.decision_function(bfeat),(1,-1))
			#If the classification is above some confidence
			if tmp_score[0,0]>0:
				rowS=int(j*cell_size/count)
				rowE=int((j+cell_num)*cell_size/count)
				colS=int(k*cell_size/count)
				colE=int((k+cell_num)*cell_size/count)
				#keep the detection and then pass all the detections for an image to non-maximum suppression
				cur_bboxes=np.concatenate([cur_bboxes,np.array([[rowS,colS,rowE,colE]])],axis=0)
				cur_image_ids=np.concatenate([cur_image_ids,[[test_images[i]]]],axis=0)
				cur_confidences=np.concatenate([cur_confidences,tmp_score],axis=0)
	    #in order to run the classifier to multiple scales#
	    #try from 0.999、0.99、0.95、0.9、0.85，the larger the slower
	    count*=0.9

Installation

Other required packages. math、os.path、sklearn.svm
How to compile from source? python proj4.py

Results

Precision Recall curve for the modified code.

Face template HoG visualization for the starter code. This is completely random, but it should actually look like a face once you train a reasonable classifier.

Precision Recall curve for the starter code.

Example of detection on the test set from the starter code.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

index.md

index.md

Your Name (id)103011105 賴怡惠

Overview

Implementation

Installation

Results

Files

index.md

Latest commit

History

index.md

File metadata and controls

Your Name (id)103011105 賴怡惠

Overview

Implementation

Installation

Results