Similar Items: Models Recall What They Violate: Constraint Adherence in Multi-Turn LLM Ideation
- Synthetic Users, Real Differences: an Evaluation Framework for User Simulation in Multi-Turn Conversations
- MCJudgeBench: A Benchmark for Constraint-Level Judge Evaluation in Multi-Constraint Instruction Following
- Rose-SQL: Role-State Evolution Guided Structured Reasoning for Multi-Turn Text-to-SQL
- From Unstructured Recall to Schema-Grounded Memory: Reliable AI Memory via Iterative, Schema-Aware Extraction
- ReLay: Personalized LLM-Generated Plain-Language Summaries for Better Understanding, but at What Cost?
- Logical Consistency as a Bridge: Improving LLM Hallucination Detection via Label Constraint Modeling between Responses and Self-Judgments