Similar Items: Multi-scenario benchmark for autonomous driving systems: Exposing diverse behavioral anomalies
- A multi-language perspective on the robustness of LLM code generation
- Machine learning, deep learning, or large language models: An empirical study on multi-label requirements classification
- Identifying performance-sensitive configurations in software systems with LLM-based agents
- A recommendation system for predicting dependencies among software changes insights from an empirical study on OpenStack
- Echoes of AI: Investigating the downstream effects of AI assistants on software maintainability
- LibreOffice 26.4 Beta Experiments with AI Writing Features and Smarter Editing Tools