Hardware

Local AI hardware, compared

Every realistic way to run AI models locally, on three axes that actually decide the build: memory bandwidth, memory capacity, and what it costs you today, new and used. GPUs, Apple Silicon, unified-memory mini-PCs, and datacenter accelerators in one chart.

Prices updated 2026-06-04 · indicative US market figures · 59 devices

The mental model, borrowed from Ahmad Osman: local AI is capacity × bandwidth × software stack. Capacity tells you what fits, bandwidth tells you how hard the box can breathe during decode, and the stack decides how much of the spec sheet you actually cash out. Don't ask "which hardware is best?" Ask "which bottleneck am I buying?"

Capacity · GB
What fits. A model plus its KV cache has to live in memory. 24GB runs a tuned 27B, 128GB unified runs much larger, 512GB fits frontier-class.
Bandwidth · GB/s
How fast it generates. Decode is memory-bound, so tokens per second tracks bandwidth. GPUs stay the bandwidth kings; unified memory trades speed for size.
Software stack
What you can cash out. The same silicon at stock vs tuned can differ several-fold. Fitting is not serving. This chart shows the first two; the stack is on you.

How to read this chart

Two questions decide a local-AI box: does the model fit, and how much speed do you get for the money. So memory runs up the side (higher = more fits) and bandwidth per dollar runs across (right = more speed for your money), with bubble size showing raw bandwidth. Up and to the right is the goal. Cheap gaming cards cluster bottom-right, great value but they run out of memory; big-memory Macs and datacenter cards sit higher yet drift left, more room but slow per dollar or priced for a server rack. Lucebox (amber) is the box you plug in at home that holds the upper right: 128 GB of unified headroom with real GPU bandwidth, tuned, for a fixed price. Tip: toggle Datacenter off to see only what you can actually buy. Click Lucebox for the build.

Show

up = more memory (what fits)  ·  right = more bandwidth per dollar (speed for the money)  ·  bubble = raw bandwidth  ·  both axes log  ·  tap Lucebox for the build

Every device

Sort any column, or tick rows to compare a few head to head. Value is capability per dollar (memory × bandwidth ÷ price), so a box with lots of fast memory for the money scores high. Price is the one that matters: new while a part is still sold, used once it is discontinued. Apple, mini-PC and Lucebox memory is the full unified pool.

Device Class Memory GB Bandwidth GB/s Price Value
★ Luceboxdetails →
RTX 3090 24GB + 128GB unified, tuned lucebox-hub stack
Lucebox 152 u 936 $4,900 new 29
MI400 / MI455X 432GB
HBM4, ships H2 2026 (estimate)
Datacenter 432 19,600 $50,000 used 169
B200 192GB Datacenter 192 8,000 $45,000 new 34
B300 288GB
Blackwell Ultra
Datacenter 288 8,000 $42,000 new 55
MI325X 256GB Datacenter 256 6,000 $20,000 new 77
MI300X 192GB Datacenter 192 5,300 $18,000 new 57
H200 141GB Datacenter 141 4,800 $33,000 new 21
H100 80GB SXM Datacenter 80 3,350 $32,000 new 8
A100 80GB Datacenter 80 2,039 $7,000 used 23
RTX 5090 32GB
Street price well above $1999 MSRP (GDDR7 supply)
Consumer GPU 32 1,792 $4,199 new 14
RTX PRO 6000 Blackwell 96GB
96GB workstation, 1792 GB/s
Workstation 96 1,792 $8,800 new 20
RTX PRO 5000 Blackwell 72GB Workstation 72 1,344 $4,900 new 20
AMD Instinct MI50 32GB
Cheap-VRAM favorite, used only
Datacenter 32 1,024 $320 used 102
RTX 3090 Ti 24GB Consumer GPU 24 1,008 $1,300 used 19
RTX 4090 24GB Consumer GPU 24 1,008 $2,300 used 11
RTX 5080 16GB Consumer GPU 16 960 $1,249 new 12
RX 7900 XTX 24GB Consumer GPU 24 960 $810 used 28
RTX 6000 Ada 48GB Workstation 48 960 $6,800 new 7
RTX 3090 24GB
24GB used-market favorite for local AI
Consumer GPU 24 936 $1,050 used 21
RTX 3080 12GB Consumer GPU 12 912 $340 used 32
Intel Arc Pro B60 Dual 48GB
Two B60 GPUs on one card
Workstation 48 912 $1,200 new 36
Tesla V100 32GB Datacenter 32 900 $750 used 38
RTX 5070 Ti 16GB Consumer GPU 16 896 $979 new 15
Radeon PRO W7900 48GB Workstation 48 864 $3,400 new 12
Radeon PRO W7800 48GB
Gigabyte AI TOP 48GB variant
Workstation 48 864 $2,700 new 15
L40S 48GB Datacenter 48 864 $8,500 new 5
Mac Studio M3 Ultra 96GB Apple 96 u 819 $3,999 new 20
Mac Studio M3 Ultra 512GB
512GB pulled, scarce under the RAM shortage
Apple 512 u 819 $30,000 used 14
RX 7900 XT 20GB Consumer GPU 20 800 $578 used 28
Mac Studio M2 Ultra 192GB Apple 192 u 800 $5,500 used 28
RTX A6000 48GB Workstation 48 768 $3,800 used 10
RTX 3080 10GB Consumer GPU 10 760 $300 used 25
RTX 4080 Super 16GB Consumer GPU 16 736 $870 used 14
RTX 4080 16GB Consumer GPU 16 717 $850 used 13
RTX 4070 Ti Super 16GB Consumer GPU 16 672 $750 used 14
RTX 5070 12GB Consumer GPU 12 672 $635 new 13
RX 9070 XT 16GB Consumer GPU 16 640 $650 new 16
RX 9070 16GB Consumer GPU 16 640 $580 new 18
Radeon AI PRO R9700 32GB Workstation 32 640 $1,299 new 16
RX 7800 XT 16GB Consumer GPU 16 624 $420 used 24
RTX 2080 Ti 22GB (modded)
China-modded 22GB refurb
Consumer GPU 22 616 $450 used 30
MacBook Pro M5 Max 128GB Apple 128 u 614 $6,749 new 12
Tenstorrent Wormhole n300 24GB
RISC-V Tensix, fully OSS stack
Workstation 24 576 $1,399 new 10
Mac Studio M4 Max 128GB
40-core; 128GB pulled, max now 96GB
Apple 128 u 546 $3,200 used 22
RX 6800 16GB Consumer GPU 16 512 $340 used 24
Tenstorrent Blackhole p150 32GB
RISC-V Blackhole, fully OSS stack
Workstation 32 512 $1,399 new 12
RTX 4070 12GB Consumer GPU 12 504 $479 used 13
Intel Arc Pro B60 24GB Workstation 24 456 $599 new 18
RTX 3060 Ti 8GB Consumer GPU 8 448 $210 used 17
Mac Studio M4 Max 36GB
32-core GPU, ~410 GB/s
Apple 36 u 410 $1,999 new 7
RTX 3060 12GB Consumer GPU 12 360 $240 used 18
Tesla P40 24GB
Cheap 24GB, no video out, needs blower
Datacenter 24 347 $300 used 28
MacBook Pro M5 Pro 64GB Apple 64 u 307 $3,499 new 6
Mac mini M4 Pro 64GB
64GB config pulled (DRAM shortage)
Apple 64 u 273 $2,200 used 8
DGX Spark 128GB (GB10)
Coherent memory + CUDA
Mini-PC 128 u 273 $4,699 new 7
Ryzen AI Max+ 395 128GB
Strix Halo, ~96GB allocatable to GPU
Mini-PC 128 u 256 $3,500 new 9
Jetson AGX Orin 64GB
275 TOPS edge dev kit
Mini-PC 64 u 204 $1,999 new 7
MacBook Air M5 32GB Apple 32 u 153 $1,499 new 3
Mac mini M4 16GB Apple 16 u 120 $799 new 2

Method. Memory bandwidth is peak theoretical. Each device shows the price that actually applies: the current new price while it is still sold, the current used price once it is discontinued. Figures are indicative US-market numbers triangulated across retailers (Newegg, Amazon, B&H, Apple) and used marketplaces (eBay sold listings, resellers) as of 2026-06-04; the 2026 memory shortage has pushed new prices up and pulled several high-memory configs from sale. Value = memory × bandwidth ÷ price (capability per dollar). Datacenter parts are quote-based and sold in 8-GPU boards, so their per-unit figures are approximate. "u" marks unified memory. Numbers move fast; treat them as a snapshot, not a quote.

Our build Lucebox Full disclosure, we make this one. An RTX 3090 (24GB, 936 GB/s) with 128GB unified memory and a tuned stack, up to 207 tok/s, plug-and-play and warrantied. Not the cheapest $/GB on the board, that is the point: it is the build done, tuned and warrantied, where stock setups trail by several times. Good value for the money, and the numbers above are here so you can judge for yourself. $4,900 See full details →