An open-source, bilingual guide in English and Spanish detailing the inner workings of Transformers has been published. The resource covers the exact mathematics and mechanics behind attention collapse and KV-cache compression.

The guide provides a visual step-by-step explanation from scratch and includes reproducible code. It connects with the author's TAF Agent project for practical testing and is available at karlesmarin.github.io/transformers-guide.