Similar Items: Efficient Training on Multiple Consumer GPUs with RoundPipe