Similar Items: Synthetic data through combinatorial optimization of pairwise probabilities