What is Tensor Core?

Question

Accepted Answer

Specialized processing units within NVIDIA GPUs designed specifically for the matrix multiplication operations that dominate AI computation, delivering massive performance gains for AI workloads. Tensor cores were introduced with NVIDIA's Volta architecture in 2017. The H100 contains 528 tensor cores capable of performing mixed-precision matrix operations at unprecedented speeds. The Blackwell B200 introduces FP4 tensor core operations, doubling throughput for inference workloads. Tensor cores can process matrix multiplications 8-16x faster than standard CUDA cores for AI workloads. They support multiple precision formats (FP64, TF32, FP16, INT8, FP4) allowing developers to trade precision for speed. Tensor cores are the key hardware innovation that made training trillion-parameter models feasible.

Tensor Core

Explore the Data

Related Terms

AI Compute

Capex (Capital Expenditure)

Data Center

Fine-Tuning

Foundation Model

Frontier Model

AI Economy Pulse