Channels - RealICU: Do LLM Agents Understand Long-Context ICU Data? A Benchmark Beyond Behavior Imitation :: FRELIP Discovery

Similar Items: RealICU: Do LLM Agents Understand Long-Context ICU Data? A Benchmark Beyond Behavior Imitation

Quick Look
A Language for Describing Agentic LLM Contexts
Quick Look
A Domain Incremental Continual Learning Benchmark for ICU Time Series Model Transportability
Quick Look
SmartEval: A Benchmark for Evaluating LLM-Generated Smart Contracts from Natural Language Specifications
Quick Look
Resilient nursing in ICU: Aadaptive practices beyond IPC protocols for MDRO management. A qualitative study
Quick Look
LLM-enabled Social Agents
Quick Look
Foresight Arena: An On-Chain Benchmark for Evaluating AI Forecasting Agents
Quick Look
EnactToM: An Evolving Benchmark for Functional Theory of Mind in Embodied Agents
Quick Look
Agent Island: A Saturation- and Contamination-Resistant Benchmark from Multiagent Games
Quick Look
When Does Hierarchy Help? Benchmarking Agent Coordination in Event-Driven Industrial Scheduling
Quick Look
Safe Multi-Agent Behavior Must Be Maintained, Not Merely Asserted: Constraint Drift in LLM-Based Multi-Agent Systems
Quick Look
Pythia: Toward Predictability-Driven Agent-Native LLM Serving
Quick Look
Coordination as an Architectural Layer for LLM-Based Multi-Agent Systems
Quick Look
Deterministic vs. LLM-Controlled Orchestration for COBOL-to-Python Modernization
Quick Look
HBEE: Human Behavioral Entropy Engine -- Pre-Registered Multi-Agent LLM Simulation of Peer-Suspicion-Based Detection Inversion
Quick Look
LLM-Foraging: Large Language Models for Decentralized Swarm Robot Foraging
Quick Look
AgenticPrecoding: LLM-Empowered Multi-Agent System for Precoding Optimization
Quick Look
PIVOT: Bridging Planning and Execution in LLM Agents via Trajectory Refinement
Quick Look
An interpretable deep learning framework for predictive modeling of postoperative infections in ICU patients
Quick Look
Beyond the Black Box: Interpretability of Agentic AI Tool Use
Quick Look
Nothing Deceives Like Success: Social Learning and the Illusion of Understanding in Science
Quick Look
The $textit{Silicon Society}$ Cookbook: Design Space of LLM-based Social Simulations
Quick Look
Governed Collaborative Memory as Artificial Selection in LLM-Based Multi-Agent Systems
Quick Look
Active Learning for Communication Structure Optimization in LLM-Based Multi-Agent Systems
Quick Look
The Memory Curse: How Expanded Recall Erodes Cooperative Intent in LLM Agents