New update is much faster (everything fits easier into VRAM), but causes artifacts and incorrect depth when going too high #141
Replies: 6 comments 7 replies
-
Here's an example showing the map produced at my previous limit (right at the edge of getting a CUDA ran out of memory error).. 1904 resolution in my case for a square image... compared to 3840 with this new update. We can go higher in numbers, but it goes downhill in quality. |
Beta Was this translation helpful? Give feedback.
-
some things, like you might look at the face and think it's more detailed, but pay attention to all details.. like her right earring in now darker, background instead of closer to the viewer as it should be.. there are background holes in her arms.. parts of her shoes are turning into background.. the grating or whatever it is at her knee level is much worse looking.. an artifact where the light is shining to the right of her.. the ground is no longer a smooth gradient, etc. I know my setup's pixel limit (or maybe it's the limit of these models).. We need to figure out if this is a limitation of the models or not and code in a total pixel limitation if it is It sounds like it may be universal, as nagadomi.. you mentioned things appearing broken over 2048 or something along those lines.. it was likely becoming broken slightly before that too |
Beta Was this translation helpful? Give feedback.
-
it's a total pixel limit.. something between 3,631,096 and 3,701,852 pixels for example 1904x1904 is 3,625,216 pixels and under this limit, so it produces an amazing result, as detailed and artifact free as possible different aspect ratios will be a lower number.. like 16:9 aspect ratio limit is likely 1428 resolution... which would produce a map of around 1428x2539 = 3,625,692 pixels another one of my tests in a different aspect ration maxed out at 1652x2198... 3,631,096 total pixels |
Beta Was this translation helpful? Give feedback.
-
@nagadomi ... here's how I calculate how to find the limit (at least this is how it works for my setup, but I'm thinking it may be universal / a limit of these Any v2 models... "original image res is 1920x804... final result 2940x1232 = 3,622,080 pixels that's a calculation I did for the attached example |
Beta Was this translation helpful? Give feedback.
-
This is a known problem that quality goes down with too high resolution input. BoostingMonocularDepth discusses this problem. |
Beta Was this translation helpful? Give feedback.
-
my speed went up! It's awesome! |
Beta Was this translation helpful? Give feedback.
-
I was lucky to find my card's maximum just before the update. I found mine maxes out around 3.7 million total pixels for the depth resolution (exact height and width depends on aspect ratio, but 3.7mil total pixel count)...
This maximum was very slow, running at a snail's pace before the update, but now it all stays in VRAM and runs fast.. like in 1 second instead of 30 seconds to 1 minute+ previously
so my max was something like 1904x1904 for a square image using AnyL v2. Now if I use anything over that.. I get the result, but it has problems. Artifacts are introduced and depth becomes incorrect
We need to somehow know what our limits were prior to this recent update. I had luckily just tested mine.. others have no idea. Others will push very high resolution, because it's easy now, but it will give worse results than some lower resolution.
maybe what I found is universal for all cards and that 3.6-3.7 million limit is the limitation of these new models. If that's the case, then that needs to be verified and then we (aka nagadomi) can try to code in a pixel limit for the Any v2 models. People may be limited otherwise due to hardware
Beta Was this translation helpful? Give feedback.
All reactions