Modèle Updated 2026-04

Transformer

Transformer Architecture

Definition

The Transformer is the neural network architecture powering all modern LLMs, invented by Google in 2017.

Tools that use transformer

ChatGPT

The world's most used conversational AI assistant

4.6/5

Claude

The AI that understands nuance, by Anthropic

4.7/5

Gemini

Google's AI assistant with 1M token context

4.5/5

DeepSeek

The open source Chinese model rivaling GPT-4

4.7/5

Frequently Asked Questions

Why did the Transformer revolutionize AI?

Thanks to the attention mechanism that processes all words in parallel (not sequentially). This captures long-range relationships in text and enables massive scaling.

Do all LLMs use Transformers?

Yes, in 2026 all major LLMs (GPT, Claude, Gemini, Llama, Mistral) are Transformer-based. Alternatives exist (Mamba, RWKV) but remain niche.

See also in the glossary

Tools that use transformer

Frequently Asked Questions