Modèle Updated 2026-04

Transformer

Transformer Architecture
Definition

The Transformer is the neural network architecture powering all modern LLMs, invented by Google in 2017.

Frequently Asked Questions

Why did the Transformer revolutionize AI?
Thanks to the attention mechanism that processes all words in parallel (not sequentially). This captures long-range relationships in text and enables massive scaling.
Do all LLMs use Transformers?
Yes, in 2026 all major LLMs (GPT, Claude, Gemini, Llama, Mistral) are Transformer-based. Alternatives exist (Mamba, RWKV) but remain niche.