Similar Items: RealICU: Do LLM Agents Understand Long-Context ICU Data? A Benchmark Beyond Behavior Imitation
- A Language for Describing Agentic LLM Contexts
- A Domain Incremental Continual Learning Benchmark for ICU Time Series Model Transportability
- SmartEval: A Benchmark for Evaluating LLM-Generated Smart Contracts from Natural Language Specifications
- Resilient nursing in ICU: Aadaptive practices beyond IPC protocols for MDRO management. A qualitative study
- LLM-enabled Social Agents
- Foresight Arena: An On-Chain Benchmark for Evaluating AI Forecasting Agents