PhaseCam3D

Project | Video | Paper

This repository contains TensorFlow implementation for the ICCP2019 paper PhaseCam3D — Learning Phase Masks for Passive Single View Depth Estimation by Yicheng Wu, Vivek Boominathan, Huaijin Chen, Aswin Sankaranarayanan, and Ashok Veeraraghavan.

Installation

Clone this repo.

git clone https://github.com/YichengWu/PhaseCam3D
cd PhaseCam3D/

The code is developed using Python 3.7.1 and TensorFlow 1.13.0. The GPU we used is NVIDIA GTX 1080 Ti (11G). Change batch_size accordingly if you run the code with different GPU memory.

Dataset

The dataset is modified based on FlyingThings3D in Scene Flow Datasets. The pre-processed TFrecord files can be downloaded from Google Drive. It contains 5077 training patches, 553 validation patches, and 419 test patches.

Train

To train the entire framework, simply run the following code.

python depth_estimation.py

Inside the code, results_dir is the output result directory, DATA_PATH is the directory of the downloaded dataset. The learning rate of the optical layer and digital network can be set individually using lr_optical and lr_digital. Detailed instruction about choosing the learning rate can be found in paper Sec. IIID(d).

Logging

We use Tensorboard for logging training progress. Recommended: execute tensorboard --logdir /path/to/save_dir --port 9001 and visit localhost:9001 in the browser.

Evaluation

Once the network is trained, the performance can be evaluated using the testing dataset.

python depth_estimation_test.py

Change results_dir to the place you save your model. Once the testing is finished, a new folder called test_all will be created inside the model directory. It contains 400 scenes, and each one includes a clean image sharp.png, a coded image blur.png, an estimated disparity map phiHat.png, and a ground truth disparity map phiGT.png.

If you want to see the performance of our best result, please download from Google Drive. Sample images are shown below.

Ablation study

We did a comprehensive ablation study by varying the learning rate, initialization, and loss. All the results are listed here. Please let us know if you outperform our results!

Exp.	Learn mask	Initialization	Loss	Error (RMS)
A	No	No mask	RMS	2.69
B	Yes	Random	RMS	1.07
C	No	Fisher mask	RMS	0.97
D	Yes	Random	RMS+CRLB	0.88
E	Yes	Fisher mask	RMS	0.74
F	Yes	Fisher mask	RMS+CRLB	0.85
G	Yes	Fisher mask	RMS+gradient	0.56

Citation

If you use this code for your research, please cite our papers.

@inproceedings{wu2019phasecam3d,
  title={PhaseCam3D—Learning Phase Masks for Passive Single View Depth Estimation},
  author={Wu, Yicheng and Boominathan, Vivek and Chen, Huaijin and Sankaranarayanan, Aswin and Veeraraghavan, Ashok},
  booktitle={2019 IEEE International Conference on Computational Photography (ICCP)},
  pages={1--12},
  year={2019},
  organization={IEEE}
}

Contributions

If you have any questions/comments/bug reports, feel free to open a github issue or pull a request or e-mail to the author Yicheng Wu ([email protected]).

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
docs		docs
figures		figures
.gitignore		.gitignore
FisherMask_HeightMap.txt		FisherMask_HeightMap.txt
Network.py		Network.py
README.md		README.md
depth_estimation.py		depth_estimation.py
depth_estimation_test.py		depth_estimation_test.py
zernike_basis.mat		zernike_basis.mat

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PhaseCam3D

Project | Video | Paper

Installation

Dataset

Train

Logging

Evaluation

Ablation study

Citation

Contributions

About

Releases

Packages

Languages

YichengWu/PhaseCam3D

Folders and files

Latest commit

History

Repository files navigation

PhaseCam3D

Project | Video | Paper

Installation

Dataset

Train

Logging

Evaluation

Ablation study

Citation

Contributions

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages