how to use intrinsics and extrinsics from colmap ? #21

Jiakui · 2022-10-12T07:21:21Z

Hi ,

I have a custom image dataset, and run colmap to get the instrinsics and extrinsics of all the images. Then , how should I proceed to run monosdf and get the mesh?

Thanks!

Jiakui

niujinshuchong · 2022-10-12T08:11:19Z

Hi, I think you could adapt this script https://github.com/autonomousvision/monosdf/blob/main/preprocess/scannet_to_monosdf.py to prepare the data

CCOSerika · 2022-10-12T11:16:28Z

Hi, author.
I run colmap and the following code to generate 'cameras.npz'.

    intrin_mat = np.array([
        [fx,  0, cx],
        [ 0, fy, cy],
        [ 0,  0,  1]
    ])
    intrin_mat[0, 2] -= (w - min(w, h)) / 2.
    intrin_mat[1, 2] -= (h - min(w, h)) / 2.
    intrin_mat[:2, :] *= 384. / min(w, h)
    
    K = np.eye(4, dtype=np.float32)
    K[:3, :3] = intrin_mat

    valid_poses = np.isfinite(poses).all(axis=2).all(axis=1)
    min_vertices = poses[:, :3, 3][valid_poses].min(axis=0)
    max_vertices = poses[:, :3, 3][valid_poses].max(axis=0)
    scale_mat = np.eye(4, dtype=np.float32)
    scale_mat[:3, 3] = - (min_vertices + max_vertices) / 2.
    scale_mat[:3] *= 2. / (np.max(max_vertices - min_vertices) + 3.) 
    scale_mat = np.linalg.inv(scale_mat)
    
    cams = {}
    idx = 0
    for _, (pose, valid) in enumerate(zip(poses, valid_poses)):
        if not valid: continue
        pose = K @ np.linalg.inv(pose)
        cams[f"scale_mat_{idx:d}"] = scale_mat
        cams[f"world_mat_{idx:d}"] = pose
        idx += 1
    np.savez(osp.join(path, "cameras.npz"), **cams)

It seems that the losses are converging, but the mesh is just strange.

The normal train reslut is as follows.

I notice the code in 'scene_dataset.py' and comment it, the result keeping bad.

elif center_crop_type == 'center_crop_for_dtu':
     scale = 384 / 1200
     offset = (1600 - 1200) * 0.5
     intrinsics[0, 2] -= offset
     intrinsics[:2, :] *= scale

I guess the issue lies in the transform process to camera poses.
Do u have any idea?

niujinshuchong · 2022-10-15T07:46:14Z

@CCOSerika Sorry for the late reply.
This line

scale_mat[:3] *= 2. / (np.max(max_vertices - min_vertices) + 3.)

is used for ScanNet (indoor scenes) where the cameras are inside the rooms. In the DTU cases it should be changed to something like

scale_mat[:3] *= 3. / (np.max(max_vertices - min_vertices))

Thermaloo · 2022-10-30T03:47:01Z

Sorry，I want to ask if pose conversion is only used to generate mesh in the evaluation phase？

niujinshuchong · 2022-12-08T12:11:36Z

@MingRuiye Sorry I missed your messages. Not sure whether I understand your question. The pose conversion is used in evaluation phase because we need to convert the extracted mesh back to original world space.

niujinshuchong closed this as completed Oct 24, 2022

yxt7979 mentioned this issue Dec 8, 2022

monosdf for outdoor dataset #51

Open

fangchuan mentioned this issue Apr 7, 2024

Questions about processing camera poses for different dataset #95

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how to use intrinsics and extrinsics from colmap ? #21

how to use intrinsics and extrinsics from colmap ? #21

Jiakui commented Oct 12, 2022

niujinshuchong commented Oct 12, 2022

CCOSerika commented Oct 12, 2022

niujinshuchong commented Oct 15, 2022

Thermaloo commented Oct 30, 2022

niujinshuchong commented Dec 8, 2022

how to use intrinsics and extrinsics from colmap ? #21

how to use intrinsics and extrinsics from colmap ? #21

Comments

Jiakui commented Oct 12, 2022

niujinshuchong commented Oct 12, 2022

CCOSerika commented Oct 12, 2022

niujinshuchong commented Oct 15, 2022

Thermaloo commented Oct 30, 2022

niujinshuchong commented Dec 8, 2022