Add seperate config for predict batch size and train batch size. #889

bw4sz · 2025-01-16T15:01:57Z

Updating model weights takes alot more GPU memory than just a forward model pass.

predict.tile is slower than it needs to be because its using trainer.predict, which inherits a dataloader with batch size set by the global config

DeepForest/src/deepforest/main.py

Line 348 in 3dbc834

batch_size=self.config["batch_size"],

and in train gets from load_dataset.

DeepForest/src/deepforest/main.py

Line 335 in 3dbc834

batch_size=self.config["batch_size"])

the default is 1 because training is unknown size GPU (probably should be 2)

Make a predict_batch_size and a train_batch_size config arg
Update defaults to 2 for train and 8 for predict.
update the config doc
Write tests showing the dataloaders of each are yielding correct sizes.

I'm unsure about the val dataloader batch size, maybe should be higher, not clear to me the GPU memory. I think val batch size should be the predict size, since no weights are updated.

rabelmervin · 2025-01-17T07:31:19Z

Hi @bw4sz ,
I saw this issue and thought it looked really interesting! I Would like to contribute to this ? Any guidance would be appreciated :)

bw4sz · 2025-01-19T23:45:33Z

Go for it. Do you have access to GPU? Not yet sure if validation batch_size and predict_batch size should be the same or separate arguments. Make sure to profile the example code. Do you need a large tile to test on, you won't notice much on the sample package data.

https://www.dropbox.com/scl/fi/yki42nmplok43isi1queb/2021_TEAK_5_322000_4097000_image.tif?rlkey=aaq4sc3jqa13oo4axuh0vw93d&dl=0

import time
import numpy as np
from deepforest import main, get_data

def profile_predict_tile(batch_sizes, raster_path):
    model = main.deepforest()
    model.load_model(model_name="weecology/deepforest-tree")
    
    for batch_size in batch_sizes:
        model.config["batch_size"] = batch_size
        start_time = time.time()
        model.predict_tile(raster_path=raster_path, patch_size=300, patch_overlap=0.25)
        end_time = time.time()
        print(f"Batch Size: {batch_size}, Time Taken: {end_time - start_time} seconds")

if __name__ == "__main__":
    raster_path = <path_to_raster>
    batch_sizes = [1, 2, 4, 8, 16]
    profile_predict_tile(batch_sizes, raster_path)

bw4sz added the good first issue Good for newcomers label Jan 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add seperate config for predict batch size and train batch size. #889

Add seperate config for predict batch size and train batch size. #889

bw4sz commented Jan 16, 2025

rabelmervin commented Jan 17, 2025

bw4sz commented Jan 19, 2025

Add seperate config for predict batch size and train batch size. #889

Add seperate config for predict batch size and train batch size. #889

Comments

bw4sz commented Jan 16, 2025

rabelmervin commented Jan 17, 2025

bw4sz commented Jan 19, 2025