BehaviorBench: Benchmarking Foundation Models for Behavioral Science Tasks
The authors introduce BehaviorBench, a comprehensive benchmark designed to evaluate foundation models across diverse behavioral science tasks and populations. The study assesses four core capabilities—behavior prediction, strategic decision-making, subject-trait inference, and behavioral knowledge application—at both individual and distributional levels.