A comprehensive, bilingual guide to Transformers: From foundations to KV-cache compression & attention dynamics
Carles Marin has released an open-source, bilingual (English and Spanish) guide that bridges the mathematical foundations of Transformer architectures with their practical implementation. The resource focuses on low-level mechanics, providing reproducible code and interactive elements to explain complex topics.