Text this: Output format biases in the evaluation of large language models for code translation