Passer au contenu principal
Publication

Attention with Markov: A Curious Case of Single-layer Transformers