Similar Items: Output format biases in the evaluation of large language models for code translation
- An evaluation study of large language models for addressing code quality issues
- ArkTS code generation: A comprehensive evaluation with large language models
- Evaluating large language models for multilingual vulnerability detection at dual granularities
- HAFix: history-augmented large language models for bug fixing
- Byam: Fixing Breaking Dependency Updates with Large Language Models
- Peer-aided repairer: empowering large language models to repair advanced student assignments