Channels - FlowEval: Reference-based Evaluation of Generated User Interfaces :: FRELIP Discovery

Similar Items: FlowEval: Reference-based Evaluation of Generated User Interfaces

Quick Look
MemFlow: Intent-Driven Memory Orchestration for Small Language Model Agents
Quick Look
Retrieval-Conditioned Topology Selection with Provable Budget Conservation for Multi-Agent Code Generation
Quick Look
RoadMapper: A Multi-Agent System for Roadmap Generation of Solving Complex Research Problems
Quick Look
Foresight Arena: An On-Chain Benchmark for Evaluating AI Forecasting Agents
Quick Look
Coordination Matters: Evaluation of Cooperative Multi-Agent Reinforcement Learning
Quick Look
SOTOPIA-TOM: Evaluating Information Management in Multi-Agent Interaction with Theory of Mind
Quick Look
SWE-WebDevBench: Evaluating Coding Agent Application Platforms as Virtual Software Agencies
Quick Look
Where Did It Go Wrong? Capability-Oriented Failure Attribution for Vision-and-Language Navigation Agents
Quick Look
A Survey of Multi-Agent Deep Reinforcement Learning with Graph Neural Network-Based Communication
Quick Look
Volitional Multiagent Atomic Transactions: Describing People and their Machines
Quick Look
Should I Replan? Learning to Spot the Right Time in Robust MAPF Execution
Quick Look
Pythia: Toward Predictability-Driven Agent-Native LLM Serving
Quick Look
Operating-Layer Controls for Onchain Language-Model Agents Under Real Capital
Quick Look
I Would If I Could: Reasoning about Dynamics of Actions in Multi-Agent Systems
Quick Look
When Agents Shop for You: Role Coherence in AI-Mediated Markets
Quick Look
Agent Name Service (ANS): A Proof-of-Concept Trust Layer for Secure AI Agent Discovery, Identity, and Governance in Kubernetes
Quick Look
Split over $n$ resource sharing problem: Are fewer capable agents better than many simpler ones?
Quick Look
Preserving Disagreement: Architectural Heterogeneity and Coherence Validation in Multi-Agent Policy Simulation
Quick Look
AGEL-Comp: A Neuro-Symbolic Framework for Compositional Generalization in Interactive Agents
Quick Look
Bian Que: An Agentic Framework with Flexible Skill Arrangement for Online System Operations
Quick Look
A High-Throughput Compute-Efficient POMDP Hide-And-Seek-Engine (HASE) for Multi-Agent Operations
Quick Look
Nothing Deceives Like Success: Social Learning and the Illusion of Understanding in Science
Quick Look
Reinforced Agent: Inference-Time Feedback for Tool-Calling Agents
Quick Look
When Roles Fail: Epistemic Constraints on Advocate Role Fidelity in LLM-Based Political Statement Analysis