Similar Items: Code-Centric Detection of Vulnerability-Fixing Commits: A Unified Benchmark and Empirical Study
- How Code Representation Shapes False-Positive Dynamics in Cross-Language LLM Vulnerability Detection
- MOSAIC-Bench: Measuring Compositional Vulnerability Induction in Coding Agents
- Security Incentivization: An Empirical Study of how Micropayments Impact Code Security
- Agentic Vulnerability Reasoning on Windows COM Binaries
- Generating Proof-of-Vulnerability Tests to Help Enhance the Security of Complex Software
- LITMUS: Benchmarking Behavioral Jailbreaks of LLM Agents in Real OS Environments