Samplers always return the same number of batches in distributed mode #267

pzelasko · 2021-04-12T21:30:47Z

…amic batch sizes

…poch

csukuangfj · 2021-04-13T01:33:36Z

lhotse/dataset/sampling.py

+    The formula used to determine which batches are returned is
+    ``(batch_idx + rank) % world_size == 0``.
+    This ensures that we can return an equal number of batches in all distributed workers
+    in spite of using a dynamic batch size, at the cost of skipping at most ``world_size`` batches.


Should it be

skipping at most ``world_size - 1`` batches.

?

csukuangfj · 2021-04-13T01:37:12Z

lhotse/dataset/sampling.py

+    DistributedSampler -- instead of partitioning the underlying cuts into equally sized chunks,
+    it will return every N-th batch and skip the other batches (where ``N == world_size``).
+    The formula used to determine which batches are returned is
+    ``(batch_idx + rank) % world_size == 0``.


Should it be

(batch_idx + (world_size - rank)) % world_size == 0

?

yes, you're right, thanks

pzelasko added 5 commits April 12, 2021 16:33

Add test for equal number of batches of distributed samplers with dyn…

c5e843c

…amic batch sizes

Ensure equal number of batches across all distributed workers in an e…

d987f85

…poch

Ensure equal number of batches across all distributed workers in an e…

02621f9

…poch

Remove print and add comments

7ee8d59

Add BucketingSampler into the test

c836656

pzelasko added this to the v0.6 milestone Apr 12, 2021

pzelasko mentioned this pull request Apr 12, 2021

DDP address in use k2-fsa/snowfall#152

Open

Documentation lifting

af4c1f5

csukuangfj reviewed Apr 13, 2021

View reviewed changes

Fix formulas in docs

8671fe9

pzelasko merged commit 9381343 into master Apr 13, 2021

pzelasko deleted the feature/fix-distributed-dynamic-batch-size-sampler branch July 1, 2021 01:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Samplers always return the same number of batches in distributed mode #267

Samplers always return the same number of batches in distributed mode #267

pzelasko commented Apr 12, 2021

csukuangfj Apr 13, 2021

csukuangfj Apr 13, 2021

pzelasko Apr 13, 2021

Samplers always return the same number of batches in distributed mode #267

Samplers always return the same number of batches in distributed mode #267

Conversation

pzelasko commented Apr 12, 2021

csukuangfj Apr 13, 2021

Choose a reason for hiding this comment

csukuangfj Apr 13, 2021

Choose a reason for hiding this comment

pzelasko Apr 13, 2021

Choose a reason for hiding this comment