Add depth estimation pipeline #18446

NielsRogge · 2022-08-03T10:35:57Z

Feature request

We currently have 2 monocular depth estimation models in the library, namely DPT and GLPN.

It would be great to have a pipeline for this task, with the following API:

from transformers import pipeline

pipe = pipeline("depth-estimation")
pipe("cats.png")

This pipeline could default to the https://huggingface.co/Intel/dpt-large checkpoint. Also check out the Space that showcases the model.

This can be implemented similar to other pipelines. For an example PR that added a pipeline, see #11598.

Motivation

Pipelines are a great way to quickly perform inference with a model for a given task, abstracting away all the complexity.

Your contribution

I can assist with this, together with @Narsil.

The text was updated successfully, but these errors were encountered:

Narsil · 2022-08-03T10:48:45Z

What would be the output like @NielsRogge ?

My understanding is that depth is just a gray scale image (black = infinitely far, white = infinitely close).

If that's the case It seems really close to image-segmentation in the sense that it's generating a new image from the original image, so we should try and reuse as much as possible.

Also maybe we could have something like image-generation to try and keep the name generic ? (And have an alias for depth-estimation for instance ?)

nandwalritik · 2022-08-04T09:21:05Z

Hi @NielsRogge I would like to add this pipeline.

NielsRogge · 2022-08-04T09:29:32Z

Hi @Narsil,

I'm not sure whether we should add this to the existing image-segmentation pipeline. Depth estimation is basically pixel regression, rather than pixel classification (the latter is image segmentation). It would be quite confusing to add it there.

Depth estimation is quite a different field, see e.g. https://paperswithcode.com/task/depth-estimation

And hi @nandwalritik, thanks for your interest in this. Feel free to start a draft PR.

nandwalritik · 2022-08-04T09:35:07Z

Thanks I will start working on it.

Narsil · 2022-08-04T12:12:58Z

I'm not sure whether we should add this to the existing image-segmentation pipeline.

I said we should inspire from it, not reuse it, but I suggested using an image-generationone. (Just to be slightly more general)
The output is a grayscale image, right ?

NielsRogge added the Good First Issue label Aug 3, 2022

nandwalritik mentioned this issue Aug 14, 2022

Add depth estimation pipeline #18618

Merged

4 tasks

sgugger closed this as completed in #18618 Oct 12, 2022

NielsRogge mentioned this issue Jan 13, 2023

Add support for BLIP and GIT in image-to-text and VQA pipelines #21110

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add depth estimation pipeline #18446

Add depth estimation pipeline #18446

NielsRogge commented Aug 3, 2022 •

edited

Loading

Narsil commented Aug 3, 2022

nandwalritik commented Aug 4, 2022

NielsRogge commented Aug 4, 2022 •

edited

Loading

nandwalritik commented Aug 4, 2022

Narsil commented Aug 4, 2022 •

edited

Loading

Add depth estimation pipeline #18446

Add depth estimation pipeline #18446

Comments

NielsRogge commented Aug 3, 2022 • edited Loading

Feature request

Motivation

Your contribution

Narsil commented Aug 3, 2022

nandwalritik commented Aug 4, 2022

NielsRogge commented Aug 4, 2022 • edited Loading

nandwalritik commented Aug 4, 2022

Narsil commented Aug 4, 2022 • edited Loading

NielsRogge commented Aug 3, 2022 •

edited

Loading

NielsRogge commented Aug 4, 2022 •

edited

Loading

Narsil commented Aug 4, 2022 •

edited

Loading