Full Text Available

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

RealICU: Do LLM Agents Understand Long-Context ICU Data? A Benchmark Beyond Behavior Imitation

Saved in:
Bibliographic Details
Published in:ArXiv cs.MA Recent Papers
Format: Online Article RSS Article
Published: 2026
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1865217956990943232
collection WordPress RSS
FRELIP Feed Integration
container_title ArXiv cs.MA Recent Papers
description
discipline_display Engineering & Technology
discipline_facet Engineering & Technology
format Online Article
RSS Article
genre Journal Article
id rss_article:51596
institution FRELIP
journal_source_facet ArXiv cs.MA Recent Papers
publishDate 2026
publishDateSort 2026
record_format rss_article
spellingShingle RealICU: Do LLM Agents Understand Long-Context ICU Data? A Benchmark Beyond Behavior Imitation
ArXiv cs.MA Recent Papers
Petroleum & Energy
Engineering & Technology
sub_discipline_display Petroleum & Energy
sub_discipline_facet Petroleum & Energy
subject_display ArXiv cs.MA Recent Papers
Petroleum & Energy
Engineering & Technology
ArXiv cs.MA Recent Papers
Petroleum & Energy
Engineering & Technology
subject_facet ArXiv cs.MA Recent Papers
Petroleum & Energy
Engineering & Technology
title RealICU: Do LLM Agents Understand Long-Context ICU Data? A Benchmark Beyond Behavior Imitation
title_auth RealICU: Do LLM Agents Understand Long-Context ICU Data? A Benchmark Beyond Behavior Imitation
title_full RealICU: Do LLM Agents Understand Long-Context ICU Data? A Benchmark Beyond Behavior Imitation
title_fullStr RealICU: Do LLM Agents Understand Long-Context ICU Data? A Benchmark Beyond Behavior Imitation
title_full_unstemmed RealICU: Do LLM Agents Understand Long-Context ICU Data? A Benchmark Beyond Behavior Imitation
title_short RealICU: Do LLM Agents Understand Long-Context ICU Data? A Benchmark Beyond Behavior Imitation
title_sort realicu: do llm agents understand long-context icu data? a benchmark beyond behavior imitation
topic ArXiv cs.MA Recent Papers
Petroleum & Energy
Engineering & Technology
url https://arxiv.org/abs/2605.13542v1