Technique Updated 2026-04

Test-time Compute

Definition

Test-time compute allocates more computation at inference time to improve response quality, instead of just scaling up the model.

Tools that use test-time compute

ChatGPT

The world's most used conversational AI assistant

4.6/5

Claude

The AI that understands nuance, by Anthropic

4.7/5

DeepSeek

The open source Chinese model rivaling GPT-4

4.7/5

Gemini

Google's AI assistant with 1M token context

4.5/5

Frequently Asked Questions

Why is test-time compute important?

Instead of making the model bigger (expensive to train), you make it think longer (expensive only when needed). That's the principle behind OpenAI o1 and DeepSeek R1.

How does it work?

The model generates multiple reasoning paths, evaluates them, and picks the best. More compute time means better answers.

See also in the glossary

Tools that use test-time compute

Frequently Asked Questions