inference
inference — What's happening when the AI is actually answering you, as opposed to when it was being trained.
Training is how the model is made. Inference is every time you use it after that. When people talk about costs, speed, or running a model on a laptop, they usually mean inference.
"Inference is expensive." "Run inference locally."