Modèle Updated 2026-04

Attention Mechanism

Definition

The attention mechanism allows a model to weigh the importance of each word relative to all others, capturing global context.

Tools that use attention mechanism

ChatGPT

The world's most used conversational AI assistant

4.6/5

Claude

The AI that understands nuance, by Anthropic

4.7/5

Gemini

Google's AI assistant with 1M token context

4.5/5

DeepSeek

The open source Chinese model rivaling GPT-4

4.7/5

Frequently Asked Questions

What does 'Attention is All You Need' mean?

It's the title of Google's 2017 paper that introduced the Transformer. It showed that the attention mechanism alone was sufficient, without recurrent networks.

Does attention have a cost?

Yes. Classic attention has quadratic cost: doubling text length quadruples computation. That's why context windows have limits.

See also in the glossary

Tools that use attention mechanism

Frequently Asked Questions