Technique Updated 2026-04

AI Inference

Definition

Inference is the process of using a trained AI model to generate predictions or responses from new data.

Tools that use ai inference

ChatGPT

The world's most used conversational AI assistant

4.6/5

Claude

The AI that understands nuance, by Anthropic

4.7/5

RunPod

GPU cloud for deploying your AI applications

4.6/5

DeepSeek

The open source Chinese model rivaling GPT-4

4.7/5

Frequently Asked Questions

What's the difference between training and inference?

Training creates the model (expensive, done once). Inference uses the model to respond (cheaper, per request). When you ask ChatGPT a question, that's inference.

Why does inference cost money?

Each request requires GPU computation. The longer the response and larger the model, the more expensive. That's why APIs charge per token.

See also in the glossary

Tools that use ai inference

Frequently Asked Questions