Search Results - (evolution OR evaluation)

Refine Results
  1. Synthetic Users, Real Differences: an Evaluation Framework for User Simulation in Multi-Turn Conversations

    Published in ArXiv cs.CL Recent Papers (2026)
    Get full text
    Online Article RSS Article
  2. Dependency Parsing Across the Resource Spectrum: Evaluating Architectures on High and Low-Resource Languages

    Published in ArXiv cs.CL Recent Papers (2026)
    Get full text
    Online Article RSS Article
  3. DoGMaTiQ: Automated Generation of Question-and-Answer Nuggets for Report Evaluation

    Published in ArXiv cs.IR Recent Papers (2026)
    Get full text
    Online Article RSS Article
  4. How Many Iterations to Jailbreak? Dynamic Budget Allocation for Multi-Turn LLM Evaluation

    Published in ArXiv cs.LG Recent Papers (2026)
    Get full text
    Online Article RSS Article
  5. Reality Check: How Avatar and Face Representation Affect the Perceptual Evaluation of Synthesized Gestures

    Published in ArXiv cs.HC Recent Papers (2026)
    Get full text
    Online Article RSS Article
  6. When the Ruler is Broken: Parsing-Induced Suppression in LLM-Based Security Log Evaluation

    Published in ArXiv cs.CR Recent Papers (2026)
    Get full text
    Online Article RSS Article
  7. LLARS: Enabling Domain Expert & Developer Collaboration for LLM Prompting, Generation and Evaluation

    Published in ArXiv cs.HC Recent Papers (2026)
    Get full text
    Online Article RSS Article
  8. WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation

    Published in ArXiv cs.CL Recent Papers (2026)
    Get full text
    Online Article RSS Article
  9. SkillSafetyBench: Evaluating Agent Safety under Skill-Facing Attack Surfaces

    Published in ArXiv cs.MA Recent Papers (2026)
    Get full text
    Online Article RSS Article
  10. Evaluation of primocane-fruiting raspberry (Rubus idaeus L.) cultivars in Estonian climatic conditions

    Published in Rural Sustainability Research (2025)
    Get full text
    Online Article RSS Article
  11. Detection and Quantitative Evaluation of Internal Bubble Defects in Basin Insulators Using Infrared Thermography

    Published in IEEE Access (2026)
    Get full text
    Online Article RSS Article
  12. Performance Evaluation of Linux-Based Parallel Redundancy Protocol (PRP) for Redundant Industrial Networks

    Published in IEEE Access (2026)
    Get full text
    Online Article RSS Article
  13. X-Means Clustering for UX Evaluation of the Candy CBT Application Using the SUS Instrument

    Get full text
    Online Article RSS Article
  14. Comparison of the performance of a Three-Dimensional Body Scanner and radiography in evaluating adult scoliosis

    Published in PeerJ (2026)
    Get full text
    Online Article RSS Article
  15. Antiviral and anti-inflammatory evaluation of herbal extracts: Implications for the management of calf diarrheal diseases

    Published in PLOS ONE (2026)
    Get full text
    Online Article RSS Article
  16. Risk prediction models for sepsis-associated encephalopathy: a systematic evaluation and meta-analysis

    Published in PeerJ (2026)
    Get full text
    Online Article RSS Article
  17. Implementation and evaluation of the Forro stream cipher in Tofino programmable hardware for remote attestation in datacenters

    Get full text
    Online Article RSS Article
  18. SHIELD: System for Harmful Explicit-Content Identification and Evaluation Through LLM-Driven Approach

    Published in IEEE Access (2026)
    Get full text
    Online Article RSS Article