Modèle Aktualisiert 2026-04

Transformer

Transformer Architecture

Definition

The Transformer is the neural network architecture powering all modern LLMs, invented by Google in 2017.

Siehe auch im Glossar

An LLM is an AI model trained on billions of texts, capable of understanding and generating human language.

The attention mechanism allows a model to weigh the importance of each word relative to all others, capturing global context.

Deep Learning

Deep Learning is a subset of Machine Learning using multi-layered neural networks to learn complex representations from raw data.

Neural Network

A neural network is a computing model inspired by the human brain, composed of layers of interconnected nodes that process information to learn patterns.

Tools, die transformer verwenden

ChatGPT

Der weltweit meistgenutzte KI-Konversationsassistent

4.6/5

Claude

Die KI, die Nuancen versteht – von Anthropic

4.7/5

Gemini

Googles KI-Assistent mit 1-Million-Token-Kontext

4.5/5

DeepSeek

Das chinesische Open-Source-Modell auf GPT-4-Niveau

4.7/5

Häufig gestellte Fragen

Why did the Transformer revolutionize AI?

Thanks to the attention mechanism that processes all words in parallel (not sequentially). This captures long-range relationships in text and enables massive scaling.

Do all LLMs use Transformers?

Yes, in 2026 all major LLMs (GPT, Claude, Gemini, Llama, Mistral) are Transformer-based. Alternatives exist (Mamba, RWKV) but remain niche.