Modèle Updated 2026-04

Mixture of Experts (MoE)

Mixture of Experts

Definition

MoE is a model architecture that activates only a fraction of its parameters for each request, making large models more efficient.

Tools that use mixture of experts

Mistral Le Chat

The sovereign European AI, GDPR-compliant

4.5/5

DeepSeek

The open source Chinese model rivaling GPT-4

4.7/5

Gemini

Google's AI assistant with 1M token context

4.5/5

ChatGPT

The world's most used conversational AI assistant

4.6/5

Frequently Asked Questions

How does MoE work?

The model contains multiple specialized 'experts'. A router decides which experts to activate for each request. Result: a 1T parameter model only uses 100B per request.

Which models use MoE?

GPT-4 (rumored), Mistral's Mixtral (confirmed), Google's Gemini, and DeepSeek V3. MoE has become the dominant architecture for very large models.

See also in the glossary

Tools that use mixture of experts

Frequently Asked Questions