[Feature] Deprecate batch_size
in favor of gradient accumulation steps
#121
Labels
enhancement
New feature or request
Reason: Easier for calculation
The text was updated successfully, but these errors were encountered: