Similar Items: CDBench: Benchmarking the mutation testing capabilities of LLMs with code defenders
- Multi-scenario benchmark for autonomous driving systems: Exposing diverse behavioral anomalies
- Meta-enhanced code: leveraging structural and functional features for precise cross-modal code search
- A multi-language perspective on the robustness of LLM code generation
- Exploring and improving knowledge distillation for pre-trained code models
- On the emergence of testing strategies: A socio-technical grounded theory
- An empirical evaluation of white-box and black-box test case prioritization techniques in CPSs modeled in Simulink