Text this: On the Utility of Equal Batch Sizes for Inference in Stochastic Gradient Descent