Similar Items: Visual Aesthetic Benchmark: Can Frontier Models Judge Beauty?
- The Evaluation Differential: When Frontier AI Models Recognise They Are Being Tested
- Assessing the Creativity of Large Language Models: Testing, Limits, and New Frontiers
- AuraMask: An Extensible Pipeline for Developing Aesthetic Anti-Facial Recognition Image Filters
- OpenWatch: A Multimodal Benchmark for Hand Gesture Recognition on Smartwatches
- From Model Uncertainty to Human Attention: Localization-Aware Visual Cues for Scalable Annotation Review
- How Frontier LLMs Adapt to Neurodivergence Context: A Measurement Framework for Surface vs. Structural Change in System-Prompted Responses