Developmental approach reveals the statistical learning of Neural Language Models: Transformers generalize from the most abstract statistical patterns
This study investigates the statistical learning and mental representation of neural language models by training Generative Transformer models on a synthetic grammar and analyzing their internal representations at various stages.