Hardware
Local AI hardware, compared
Every realistic way to run AI models locally, on three axes that actually decide the build: memory bandwidth, memory capacity, and what it costs you today, new and used. GPUs, Apple Silicon, unified-memory mini-PCs, and datacenter accelerators in one chart.
Prices updated 2026-06-04 · indicative US market figures · 59 devices
The mental model, borrowed from Ahmad Osman: local AI is capacity × bandwidth × software stack. Capacity tells you what fits, bandwidth tells you how hard the box can breathe during decode, and the stack decides how much of the spec sheet you actually cash out. Don't ask "which hardware is best?" Ask "which bottleneck am I buying?"
How to read this chart
Two questions decide a local-AI box: does the model fit, and how much speed do you get for the money. So memory runs up the side (higher = more fits) and bandwidth per dollar runs across (right = more speed for your money), with bubble size showing raw bandwidth. Up and to the right is the goal. Cheap gaming cards cluster bottom-right, great value but they run out of memory; big-memory Macs and datacenter cards sit higher yet drift left, more room but slow per dollar or priced for a server rack. Lucebox (amber) is the box you plug in at home that holds the upper right: 128 GB of unified headroom with real GPU bandwidth, tuned, for a fixed price. Tip: toggle Datacenter off to see only what you can actually buy. Click Lucebox for the build.
up = more memory (what fits) · right = more bandwidth per dollar (speed for the money) · bubble = raw bandwidth · both axes log · tap Lucebox for the build
Every device
Sort any column, or tick rows to compare a few head to head. Value is capability per dollar (memory × bandwidth ÷ price), so a box with lots of fast memory for the money scores high. Price is the one that matters: new while a part is still sold, used once it is discontinued. Apple, mini-PC and Lucebox memory is the full unified pool.
| Device ⇅ | Class ⇅ | Memory GB ⇅ | Bandwidth GB/s ⇅ | Price ⇅ | Value ⇅ | |
|---|---|---|---|---|---|---|
| ★ Luceboxdetails → RTX 3090 24GB + 128GB unified, tuned lucebox-hub stack | Lucebox | 152 u | 936 | $4,900 new | 29 | |
| MI400 / MI455X 432GB HBM4, ships H2 2026 (estimate) | Datacenter | 432 | 19,600 | $50,000 used | 169 | |
| B200 192GB | Datacenter | 192 | 8,000 | $45,000 new | 34 | |
| B300 288GB Blackwell Ultra | Datacenter | 288 | 8,000 | $42,000 new | 55 | |
| MI325X 256GB | Datacenter | 256 | 6,000 | $20,000 new | 77 | |
| MI300X 192GB | Datacenter | 192 | 5,300 | $18,000 new | 57 | |
| H200 141GB | Datacenter | 141 | 4,800 | $33,000 new | 21 | |
| H100 80GB SXM | Datacenter | 80 | 3,350 | $32,000 new | 8 | |
| A100 80GB | Datacenter | 80 | 2,039 | $7,000 used | 23 | |
| RTX 5090 32GB Street price well above $1999 MSRP (GDDR7 supply) | Consumer GPU | 32 | 1,792 | $4,199 new | 14 | |
| RTX PRO 6000 Blackwell 96GB 96GB workstation, 1792 GB/s | Workstation | 96 | 1,792 | $8,800 new | 20 | |
| RTX PRO 5000 Blackwell 72GB | Workstation | 72 | 1,344 | $4,900 new | 20 | |
| AMD Instinct MI50 32GB Cheap-VRAM favorite, used only | Datacenter | 32 | 1,024 | $320 used | 102 | |
| RTX 3090 Ti 24GB | Consumer GPU | 24 | 1,008 | $1,300 used | 19 | |
| RTX 4090 24GB | Consumer GPU | 24 | 1,008 | $2,300 used | 11 | |
| RTX 5080 16GB | Consumer GPU | 16 | 960 | $1,249 new | 12 | |
| RX 7900 XTX 24GB | Consumer GPU | 24 | 960 | $810 used | 28 | |
| RTX 6000 Ada 48GB | Workstation | 48 | 960 | $6,800 new | 7 | |
| RTX 3090 24GB 24GB used-market favorite for local AI | Consumer GPU | 24 | 936 | $1,050 used | 21 | |
| RTX 3080 12GB | Consumer GPU | 12 | 912 | $340 used | 32 | |
| Intel Arc Pro B60 Dual 48GB Two B60 GPUs on one card | Workstation | 48 | 912 | $1,200 new | 36 | |
| Tesla V100 32GB | Datacenter | 32 | 900 | $750 used | 38 | |
| RTX 5070 Ti 16GB | Consumer GPU | 16 | 896 | $979 new | 15 | |
| Radeon PRO W7900 48GB | Workstation | 48 | 864 | $3,400 new | 12 | |
| Radeon PRO W7800 48GB Gigabyte AI TOP 48GB variant | Workstation | 48 | 864 | $2,700 new | 15 | |
| L40S 48GB | Datacenter | 48 | 864 | $8,500 new | 5 | |
| Mac Studio M3 Ultra 96GB | Apple | 96 u | 819 | $3,999 new | 20 | |
| Mac Studio M3 Ultra 512GB 512GB pulled, scarce under the RAM shortage | Apple | 512 u | 819 | $30,000 used | 14 | |
| RX 7900 XT 20GB | Consumer GPU | 20 | 800 | $578 used | 28 | |
| Mac Studio M2 Ultra 192GB | Apple | 192 u | 800 | $5,500 used | 28 | |
| RTX A6000 48GB | Workstation | 48 | 768 | $3,800 used | 10 | |
| RTX 3080 10GB | Consumer GPU | 10 | 760 | $300 used | 25 | |
| RTX 4080 Super 16GB | Consumer GPU | 16 | 736 | $870 used | 14 | |
| RTX 4080 16GB | Consumer GPU | 16 | 717 | $850 used | 13 | |
| RTX 4070 Ti Super 16GB | Consumer GPU | 16 | 672 | $750 used | 14 | |
| RTX 5070 12GB | Consumer GPU | 12 | 672 | $635 new | 13 | |
| RX 9070 XT 16GB | Consumer GPU | 16 | 640 | $650 new | 16 | |
| RX 9070 16GB | Consumer GPU | 16 | 640 | $580 new | 18 | |
| Radeon AI PRO R9700 32GB | Workstation | 32 | 640 | $1,299 new | 16 | |
| RX 7800 XT 16GB | Consumer GPU | 16 | 624 | $420 used | 24 | |
| RTX 2080 Ti 22GB (modded) China-modded 22GB refurb | Consumer GPU | 22 | 616 | $450 used | 30 | |
| MacBook Pro M5 Max 128GB | Apple | 128 u | 614 | $6,749 new | 12 | |
| Tenstorrent Wormhole n300 24GB RISC-V Tensix, fully OSS stack | Workstation | 24 | 576 | $1,399 new | 10 | |
| Mac Studio M4 Max 128GB 40-core; 128GB pulled, max now 96GB | Apple | 128 u | 546 | $3,200 used | 22 | |
| RX 6800 16GB | Consumer GPU | 16 | 512 | $340 used | 24 | |
| Tenstorrent Blackhole p150 32GB RISC-V Blackhole, fully OSS stack | Workstation | 32 | 512 | $1,399 new | 12 | |
| RTX 4070 12GB | Consumer GPU | 12 | 504 | $479 used | 13 | |
| Intel Arc Pro B60 24GB | Workstation | 24 | 456 | $599 new | 18 | |
| RTX 3060 Ti 8GB | Consumer GPU | 8 | 448 | $210 used | 17 | |
| Mac Studio M4 Max 36GB 32-core GPU, ~410 GB/s | Apple | 36 u | 410 | $1,999 new | 7 | |
| RTX 3060 12GB | Consumer GPU | 12 | 360 | $240 used | 18 | |
| Tesla P40 24GB Cheap 24GB, no video out, needs blower | Datacenter | 24 | 347 | $300 used | 28 | |
| MacBook Pro M5 Pro 64GB | Apple | 64 u | 307 | $3,499 new | 6 | |
| Mac mini M4 Pro 64GB 64GB config pulled (DRAM shortage) | Apple | 64 u | 273 | $2,200 used | 8 | |
| DGX Spark 128GB (GB10) Coherent memory + CUDA | Mini-PC | 128 u | 273 | $4,699 new | 7 | |
| Ryzen AI Max+ 395 128GB Strix Halo, ~96GB allocatable to GPU | Mini-PC | 128 u | 256 | $3,500 new | 9 | |
| Jetson AGX Orin 64GB 275 TOPS edge dev kit | Mini-PC | 64 u | 204 | $1,999 new | 7 | |
| MacBook Air M5 32GB | Apple | 32 u | 153 | $1,499 new | 3 | |
| Mac mini M4 16GB | Apple | 16 u | 120 | $799 new | 2 |
Method. Memory bandwidth is peak theoretical. Each device shows the price that actually applies: the current new price while it is still sold, the current used price once it is discontinued. Figures are indicative US-market numbers triangulated across retailers (Newegg, Amazon, B&H, Apple) and used marketplaces (eBay sold listings, resellers) as of 2026-06-04; the 2026 memory shortage has pushed new prices up and pulled several high-memory configs from sale. Value = memory × bandwidth ÷ price (capability per dollar). Datacenter parts are quote-based and sold in 8-GPU boards, so their per-unit figures are approximate. "u" marks unified memory. Numbers move fast; treat them as a snapshot, not a quote.
Our build Lucebox Full disclosure, we make this one. An RTX 3090 (24GB, 936 GB/s) with 128GB unified memory and a tuned stack, up to 207 tok/s, plug-and-play and warrantied. Not the cheapest $/GB on the board, that is the point: it is the build done, tuned and warrantied, where stock setups trail by several times. Good value for the money, and the numbers above are here so you can judge for yourself. $4,900 See full details →