Methodological Framework for Evaluating Social Bias in LLMs
A unified framework standardizes benchmark evaluations to compare isolated versus comparative settings for social bias detection. Results show comparative settings amplify latent discrimination, especially with Chain-of-Thought reasoning, and this bias persists even with neutral fallbacks. The effect scales with model size, suggesting comparative deployments are unsafe in ambiguous real-world scenarios.