Search Results - (evolution OR evaluation)

  1. Evaluating the scotopic visual sensitivity of walleye (Sander vitreus) and implications for foraging habitat

    Published in PeerJ (2026)
    Get full text
    Online Article RSS Article
  2. Evaluation of the Photocatalytic Degradation of Acetamiprid and its Ecotoxicological Impacts using ZnO

    Published in Water, Air, & Soil Pollution (2026)
    Get full text
    Online Article RSS Article
  3. SynSQL: Synthesizing Relational Databases for Robust Evaluation of Text-to-SQL Systems

    Published in ArXiv cs.DB Recent Papers (2026)
    Get full text
    Online Article RSS Article
  4. FinSafetyBench: Evaluating LLM Safety in Real-World Financial Scenarios

    Published in ArXiv cs.CL Recent Papers (2026)
    Get full text
    Online Article RSS Article
  5. SOTOPIA-TOM: Evaluating Information Management in Multi-Agent Interaction with Theory of Mind

    Published in ArXiv cs.MA Recent Papers (2026)
    Get full text
    Online Article RSS Article
  6. Evaluating Different Modalities of Behavioral Approach Tests for Spider Phobia in Virtual Reality

    Published in ArXiv cs.HC Recent Papers (2026)
    Get full text
    Online Article RSS Article
  7. Rethinking Reasoning-Intensive Retrieval: Evaluating and Advancing Retrievers in Agentic Search Systems

    Published in ArXiv cs.IR Recent Papers (2026)
    Get full text
    Online Article RSS Article
  8. Deployment-Relevant Alignment Cannot Be Inferred from Model-Level Evaluation Alone

    Published in ArXiv cs.HC Recent Papers (2026)
    Get full text
    Online Article RSS Article
  9. AgentTrust: Runtime Safety Evaluation and Interception for AI Agent Tool Use

    Published in ArXiv cs.CR Recent Papers (2026)
    Get full text
    Online Article RSS Article
  10. UX in the Age of AI: Rethinking Evaluation Metrics Through a Statistical Lens

    Published in ArXiv cs.HC Recent Papers (2026)
    Get full text
    Online Article RSS Article
  11. Cited but Not Verified: Parsing and Evaluating Source Attribution in LLM Deep Research Agents

    Published in ArXiv cs.CL Recent Papers (2026)
    Get full text
    Online Article RSS Article
  12. Duplicate-Aware Shift-and-Lift Carleman Linearization:Structure, Complexity, and Comparative Evaluation

    Published in ArXiv cs.CE Recent Papers (2026)
    Get full text
    Online Article RSS Article
  13. MedVIGIL: Evaluating Trustworthy Medical VLMs Under Broken Visual Evidence

    Published in ArXiv cs.CV Recent Papers (2026)
    Get full text
    Online Article RSS Article
  14. CalBench: Evaluating Coordination-Privacy Trade-offs in Multi-Agent LLMs

    Published in ArXiv cs.MA Recent Papers (2026)
    Get full text
    Online Article RSS Article
  15. Evolutionary-Algorithm-Based Automatic Prompt Generation for Vision-Language Model Evaluation

    Published in IEEE Access (2025)
    Get full text
    Online Article RSS Article
  16. Threat Modelling using Domain-Adapted Language Models: Empirical Evaluation and Insights

    Published in ArXiv cs.CR Recent Papers (2026)
    Get full text
    Online Article RSS Article
  17. MaD Physics: Evaluating information seeking under constraints in physical environments

    Published in ArXiv cs.AI Recent Papers (2026)
    Get full text
    Online Article RSS Article
  18. Battling biofilms: evaluating selected agents against Cutibacterium acnes—a review

    Published in PeerJ (2026)
    Get full text
    Online Article RSS Article
  19. In-vitro evaluation of probiotic potential of gut microbes isolated from retail chicken

    Published in PLOS ONE (2026)
    Get full text
    Online Article RSS Article
  20. Evaluation of the antibacterial activity of the natural product α-mangostin against Clostridioides difficile

    Published in PLOS ONE (2026)
    Get full text
    Online Article RSS Article