DiARC: Distinguishing Positive and Negative Samples Helps Improving ARC-like Reasoning Ability of Large Language Models
The paper introduces DiARC, a method that improves the abstract reasoning capabilities of large language models by incorporating negative sample supervision alongside positive examples. This approach addresses the limitations of current methods that rely heavily on data augmentation or expensive closed-source models.