Channels - CDBench: Benchmarking the mutation testing capabilities of LLMs with code defenders :: FRELIP Discovery

Similar Items: CDBench: Benchmarking the mutation testing capabilities of LLMs with code defenders

Quick Look
Multi-scenario benchmark for autonomous driving systems: Exposing diverse behavioral anomalies
Quick Look
Meta-enhanced code: leveraging structural and functional features for precise cross-modal code search
Quick Look
A multi-language perspective on the robustness of LLM code generation
Quick Look
Exploring and improving knowledge distillation for pre-trained code models
Quick Look
On the emergence of testing strategies: A socio-technical grounded theory
Quick Look
An empirical evaluation of white-box and black-box test case prioritization techniques in CPSs modeled in Simulink
Quick Look
Echoes of AI: Investigating the downstream effects of AI assistants on software maintainability
Quick Look
LibreOffice 26.4 Beta Experiments with AI Writing Features and Smarter Editing Tools
Quick Look
Linux 7.1-rc2 Released with Driver Fixes, Steam Deck OLED Audio Repair, and Growing AI Patch Trends
Quick Look
PRaFFLineDP: Feature fusion with progressive ranking for efficient line-level defect prediction
Quick Look
Ubuntu 26.10 Development Officially Begins as ‘Stonking Stingray’ Takes Shape
Quick Look
BudsLink Brings Advanced Earbud Controls to Linux Desktops
Quick Look
An empirical analysis of vulnerability detection tools for solidity smart contracts
Quick Look
HypeAssign: Hypergraph contrastive learning for issue assignment
Quick Look
Debian Experiments with AI-Assisted Bug Triage as Open-Source Projects Face Growing Report Overload
Quick Look
Is this build failure related to my patch? An empirical study of unrelated build failures in continuous integration
Quick Look
Alpine Linux Experiments with Systemd Compatibility While Keeping Its Lightweight Identity
Quick Look
Machine learning, deep learning, or large language models: An empirical study on multi-label requirements classification
Quick Look
CMF-Vul: Advancing automated vulnerability detection via contrastive multimodal fusion and challenge-driven representation learning
Quick Look
Effective Fine-tuning for Low-resource Languages: A Case Study of Cangjie
Quick Look
From brittle to robust: Improving LLM annotations for SE optimization
Quick Look
Understanding developer well-being: measuring mental health and productivity in software teams
Quick Look
Plug it and Play on Logs: A configuration-free statistic-based log parser
Quick Look
GNOME 51 Development Officially Begins as ‘A Coruña’ Cycle Gets Underway