Similar Items: Toward Causal Field Evaluations of AI Systems