Computer Atlas

GPU

Also known as: graphics processing unit, gpus

core beginner technology 2 min read · Updated 2026-06-07

A processor designed for massive data-parallel work — originally for rendering graphics, now also the workhorse of machine learning, simulation, and crypto.

Primary domain
Graphics & Media
Sub-category
Graphics Processing Units

In simple terms

A GPU (graphics processing unit) is a chip with thousands of small cores that all do the same operation on different pieces of data at the same time. It was designed for rendering pixels, where every pixel needs roughly the same maths. Today it’s the engine behind almost all modern machine learning.

More detail

CPUs and GPUs are both processors but optimised for opposite things:

PropertyCPUGPU
Number of cores~4–128Thousands
Per-core powerVery powerful, complexSimple, narrow
Branch handlingExcellent (branch prediction)Poor (whole groups stall)
Best atSequential, branchy workMassive data-parallel work
Memory modelCoherent caches, low latencyHigh-bandwidth, higher latency

GPUs execute in warps / wavefronts of 32–64 lanes; all lanes in a warp do the same instruction at the same time (SIMD/SIMT). This is exactly what dense matrix multiplication needs — which is exactly what neural-network training does.

Programmable APIs that exposed GPUs beyond graphics:

  • CUDA (Nvidia, 2007) — the de facto standard for ML.
  • OpenCL — cross-vendor, less popular.
  • ROCm (AMD), Metal (Apple), DirectX Compute (Microsoft).

In 2026, a top-end data-centre GPU has tens of billions of transistors, 80+ GB of HBM memory, and performs hundreds of teraflops to petaflops on AI workloads.

Why it matters

Without GPUs, modern AI would not exist at its current scale. Training a large language model on CPUs would take centuries.

Real-world examples

  • Nvidia H100 / B100 — the standard ML training GPUs.

  • Apple M-series GPUs handle the screen on a Mac and run on-device ML.

  • A gaming PC uses its GPU for 3D rendering primarily but increasingly also for upscaling (DLSS) and physics.

  • Modern data-centre GPUs now ship with HBM3 stacks delivering 3+ TB/s of memory bandwidth — orders of magnitude more than CPU memory, and the reason GPUs dominate AI training.

Common misconceptions

  • “GPUs are faster than CPUs.” Only at parallel work. A GPU is terrible at sequential code with lots of branches.
  • “GPUs are only for graphics.” That hasn’t been true for two decades; today most data-centre GPUs barely render anything.

Learn next

The architectural cousin: CPU. What GPUs are mostly used for now: neural network.

Neighborhood

A visual companion to the relationships above. Click any node to visit that topic.