Chain-of-thought (CoT) transformers can efficiently simulate Word RAM algorithms with only poly-logarithmic overhead. This efficiency improves to log-square for flat instruction sets and logarithmic for multiplication-free ones, contrasting with prior Turing machine simulations that require quadratic overhead.
CoT Transformers Can Efficiently Simulate Word RAM Algorithms
from English