What is Inference?

Question

What is Inference?

Accepted Answer

The process of running a trained AI model to generate predictions or outputs — as opposed to training, which is the process of building the model. Inference accounts for the majority of AI's ongoing energy consumption. While training a frontier model is a one-time (if massive) compute cost, inference runs continuously as millions of users interact with AI systems daily. A single ChatGPT query uses roughly 10x the electricity of a Google search. As AI adoption scales to billions of users, inference energy demand is projected to far exceed training costs, contributing to the projected growth of AI electricity consumption from 560 TWh in 2025 to potentially 1,000 TWh by 2030.

Inference

Live Data

Explore the Data

Related Terms

AI Compute

Capex (Capital Expenditure)

ChatGPT

Data Center

Fine-Tuning

Foundation Model

AI Economy Pulse