Similar Items: Low-Cost Black-Box Detection of LLM Hallucinations via Dynamical System Prediction
- Continual Knowledge Updating in LLM Systems: Learning Through Multi-Timescale Memory Dynamics
- Evaluating the Architectural Reasoning Capabilities of LLM Provers via the Obfuscated Natural Number Game
- FlexiTac: A Low-Cost, Open-Source, Scalable Tactile Sensing Solution for Robotic Systems
- How Many Iterations to Jailbreak? Dynamic Budget Allocation for Multi-Turn LLM Evaluation
- Self-Play Enhancement via Advantage-Weighted Refinement in Online Federated LLM Fine-Tuning with Real-Time Feedback
- Generating Statistical Charts with Validation-Driven LLM Workflows