The 2,500 questions that make up the exam are specifically designed to probe the outer limits of what today’s AI systems ...
TII Racing set the fastest autonomous lap of the Championship, establishing a new benchmark for high-speed, vision-based ...
A massive new study comparing more than 100,000 people with today’s most advanced AI systems delivers a surprising result: ...
Anthropic has been forced to redesign its engineering hiring test multiple times. The AI startup had to make this change ...
AI now beats average humans on a creativity test, but the most creative people still outperform every AI model tested.
Introduction: Despite digital advances in healthcare, clinical neuropsychology has been slow to adopt automated assessment tools. Automated scoring of the Rey-Osterrieth Complex Figure Test (ROCFT) ...
We introduce a new benchmark, MoToMQA, to assess human and LLM ToM abilities at increasing orders. MoToMQA is based upon the format of the Imposing Memory Task (IMT), a well-validated psychological ...
Automated assessment of human motion plays a vital role in rehabilitation, enabling objective evaluation of patient performance and progress. Unlike general human activity recognition, rehabilitation ...
Abstract: The increasing labor shortage and aging population underline the need for assistive robots to support human care recipients. To enable safe and responsive assistance, robots require accurate ...
Researchers have developed the first scientifically validated "personality test" framework for popular AI chatbots, and have shown that chatbots not only mimic human personality traits, but their ...
This is read by an automated voice. Please report any issues or inconsistencies here. Readers challenge whether SAT scores predict success, citing a high scorer rejected by 16 universities who later ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results