Question for speed up #3

jameslahm · 2023-03-24T12:32:44Z

Thank you for your great work! In Table 2 in the paper, I see that the pruned DeiT-Tiny can speed up the throughput from 2648.7 to 4496.2.

But in my local test, I found that the pruned DeiT-Tiny's throughput (1819) is similar to the original DeiT-Tiny (1760). I use the provided compressed DeiT-Tiny model (Acc@1: 71.6, https://drive.google.com/file/d/1NSq3SRxnObfl6oaFE5gHtjnhzm0Lfc6S/view?usp=sharing). My environment is RTX 3090 and the throughput code is below:

@torch.no_grad()
def throughput(data_loader, model, local_rank):
    model.eval()

    for idx, (images, _) in enumerate(data_loader):
        images = images.cuda(non_blocking=True)

        batch_size = images.shape[0]
        for i in range(50):
            model(images)
        torch.cuda.synchronize()
        tic1 = time.time()
        for i in range(30):
            model(images)
        torch.cuda.synchronize()
        tic2 = time.time()
        throughput = 30 * batch_size / (tic2 - tic1)
        if local_rank == 0:
            print("throughput averaged with 30 times")
            print(f"batch_size {batch_size} throughput {throughput}")
        return

I wonder if I did something wrong. Would you mind sharing your code for testing throughput? Thanks a lot.

The text was updated successfully, but these errors were encountered:

Daner-Wang · 2023-03-25T14:01:45Z

Thank you for your comments. In our evaluation, we test the inference time of all MHSA and FFN modules in the model to estimate its throughputs. We have adopted your code and made comparison, in which we find that your results may be influenced by the token selection function which is not well optimized. Thank your for helping us to find this problem and we will try to optimize this function to make it faster.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question for speed up #3

Question for speed up #3

jameslahm commented Mar 24, 2023

Daner-Wang commented Mar 25, 2023

Question for speed up #3

Question for speed up #3

Comments

jameslahm commented Mar 24, 2023

Daner-Wang commented Mar 25, 2023