Hardware

Local AI hardware, compared

Name: Local AI hardware: memory bandwidth, VRAM and price comparison
Creator: Lucebox

Every realistic way to run AI models locally, on three axes that actually decide the build: memory bandwidth, memory capacity, and what it costs you today, new and used. GPUs, Apple Silicon, unified-memory mini-PCs, and datacenter accelerators in one chart.

Prices updated 2026-06-04 · indicative US market figures · 59 devices

The mental model, borrowed from Ahmad Osman: local AI is capacity × bandwidth × software stack. Capacity tells you what fits, bandwidth tells you how hard the box can breathe during decode, and the stack decides how much of the spec sheet you actually cash out. Don't ask "which hardware is best?" Ask "which bottleneck am I buying?"

Capacity · GB

What fits. A model plus its KV cache has to live in memory. 24GB runs a tuned 27B, 128GB unified runs much larger, 512GB fits frontier-class.

Bandwidth · GB/s

How fast it generates. Decode is memory-bound, so tokens per second tracks bandwidth. GPUs stay the bandwidth kings; unified memory trades speed for size.

Software stack

What you can cash out. The same silicon at stock vs tuned can differ several-fold. Fitting is not serving. This chart shows the first two; the stack is on you.

How to read this chart

Two questions decide a local-AI box: does the model fit, and how much speed do you get for the money. So memory runs up the side (higher = more fits) and bandwidth per dollar runs across (right = more speed for your money), with bubble size showing raw bandwidth. Up and to the right is the goal. Cheap gaming cards cluster bottom-right, great value but they run out of memory; big-memory Macs and datacenter cards sit higher yet drift left, more room but slow per dollar or priced for a server rack. Lucebox (amber) is the box you plug in at home that holds the upper right: 128 GB of unified headroom with real GPU bandwidth, tuned, for a fixed price. Tip: toggle Datacenter off to see only what you can actually buy. Click Lucebox for the build.

Show

up = more memory (what fits) · right = more bandwidth per dollar (speed for the money) · bubble = raw bandwidth · both axes log · tap Lucebox for the build

Every device

Sort any column, or tick rows to compare a few head to head. Value is capability per dollar (memory × bandwidth ÷ price), so a box with lots of fast memory for the money scores high. Price is the one that matters: new while a part is still sold, used once it is discontinued. Apple, mini-PC and Lucebox memory is the full unified pool.

	Device ⇅	Class ⇅	Memory GB ⇅	Bandwidth GB/s ⇅	Price ⇅	Value ⇅
	★ Luceboxdetails → RTX 3090 24GB + 128GB unified, tuned lucebox-hub stack	Lucebox	152 u	936	$4,900 new	29
	MI400 / MI455X 432GB HBM4, ships H2 2026 (estimate)	Datacenter	432	19,600	$50,000 used	169
	B200 192GB	Datacenter	192	8,000	$45,000 new	34
	B300 288GB Blackwell Ultra	Datacenter	288	8,000	$42,000 new	55
	MI325X 256GB	Datacenter	256	6,000	$20,000 new	77
	MI300X 192GB	Datacenter	192	5,300	$18,000 new	57
	H200 141GB	Datacenter	141	4,800	$33,000 new	21
	H100 80GB SXM	Datacenter	80	3,350	$32,000 new	8
	A100 80GB	Datacenter	80	2,039	$7,000 used	23
	RTX 5090 32GB Street price well above $1999 MSRP (GDDR7 supply)	Consumer GPU	32	1,792	$4,199 new	14
	RTX PRO 6000 Blackwell 96GB 96GB workstation, 1792 GB/s	Workstation	96	1,792	$8,800 new	20
	RTX PRO 5000 Blackwell 72GB	Workstation	72	1,344	$4,900 new	20
	AMD Instinct MI50 32GB Cheap-VRAM favorite, used only	Datacenter	32	1,024	$320 used	102
	RTX 3090 Ti 24GB	Consumer GPU	24	1,008	$1,300 used	19
	RTX 4090 24GB	Consumer GPU	24	1,008	$2,300 used	11
	RTX 5080 16GB	Consumer GPU	16	960	$1,249 new	12
	RX 7900 XTX 24GB	Consumer GPU	24	960	$810 used	28
	RTX 6000 Ada 48GB	Workstation	48	960	$6,800 new	7
	RTX 3090 24GB 24GB used-market favorite for local AI	Consumer GPU	24	936	$1,050 used	21
	RTX 3080 12GB	Consumer GPU	12	912	$340 used	32
	Intel Arc Pro B60 Dual 48GB Two B60 GPUs on one card	Workstation	48	912	$1,200 new	36
	Tesla V100 32GB	Datacenter	32	900	$750 used	38
	RTX 5070 Ti 16GB	Consumer GPU	16	896	$979 new	15
	Radeon PRO W7900 48GB	Workstation	48	864	$3,400 new	12
	Radeon PRO W7800 48GB Gigabyte AI TOP 48GB variant	Workstation	48	864	$2,700 new	15
	L40S 48GB	Datacenter	48	864	$8,500 new	5
	Mac Studio M3 Ultra 96GB	Apple	96 u	819	$3,999 new	20
	Mac Studio M3 Ultra 512GB 512GB pulled, scarce under the RAM shortage	Apple	512 u	819	$30,000 used	14
	RX 7900 XT 20GB	Consumer GPU	20	800	$578 used	28
	Mac Studio M2 Ultra 192GB	Apple	192 u	800	$5,500 used	28
	RTX A6000 48GB	Workstation	48	768	$3,800 used	10
	RTX 3080 10GB	Consumer GPU	10	760	$300 used	25
	RTX 4080 Super 16GB	Consumer GPU	16	736	$870 used	14
	RTX 4080 16GB	Consumer GPU	16	717	$850 used	13
	RTX 4070 Ti Super 16GB	Consumer GPU	16	672	$750 used	14
	RTX 5070 12GB	Consumer GPU	12	672	$635 new	13
	RX 9070 XT 16GB	Consumer GPU	16	640	$650 new	16
	RX 9070 16GB	Consumer GPU	16	640	$580 new	18
	Radeon AI PRO R9700 32GB	Workstation	32	640	$1,299 new	16
	RX 7800 XT 16GB	Consumer GPU	16	624	$420 used	24
	RTX 2080 Ti 22GB (modded) China-modded 22GB refurb	Consumer GPU	22	616	$450 used	30
	MacBook Pro M5 Max 128GB	Apple	128 u	614	$6,749 new	12
	Tenstorrent Wormhole n300 24GB RISC-V Tensix, fully OSS stack	Workstation	24	576	$1,399 new	10
	Mac Studio M4 Max 128GB 40-core; 128GB pulled, max now 96GB	Apple	128 u	546	$3,200 used	22
	RX 6800 16GB	Consumer GPU	16	512	$340 used	24
	Tenstorrent Blackhole p150 32GB RISC-V Blackhole, fully OSS stack	Workstation	32	512	$1,399 new	12
	RTX 4070 12GB	Consumer GPU	12	504	$479 used	13
	Intel Arc Pro B60 24GB	Workstation	24	456	$599 new	18
	RTX 3060 Ti 8GB	Consumer GPU	8	448	$210 used	17
	Mac Studio M4 Max 36GB 32-core GPU, ~410 GB/s	Apple	36 u	410	$1,999 new	7
	RTX 3060 12GB	Consumer GPU	12	360	$240 used	18
	Tesla P40 24GB Cheap 24GB, no video out, needs blower	Datacenter	24	347	$300 used	28
	MacBook Pro M5 Pro 64GB	Apple	64 u	307	$3,499 new	6
	Mac mini M4 Pro 64GB 64GB config pulled (DRAM shortage)	Apple	64 u	273	$2,200 used	8
	DGX Spark 128GB (GB10) Coherent memory + CUDA	Mini-PC	128 u	273	$4,699 new	7
	Ryzen AI Max+ 395 128GB Strix Halo, ~96GB allocatable to GPU	Mini-PC	128 u	256	$3,500 new	9
	Jetson AGX Orin 64GB 275 TOPS edge dev kit	Mini-PC	64 u	204	$1,999 new	7
	MacBook Air M5 32GB	Apple	32 u	153	$1,499 new	3
	Mac mini M4 16GB	Apple	16 u	120	$799 new	2
No devices match.

Method. Memory bandwidth is peak theoretical. Each device shows the price that actually applies: the current new price while it is still sold, the current used price once it is discontinued. Figures are indicative US-market numbers triangulated across retailers (Newegg, Amazon, B&H, Apple) and used marketplaces (eBay sold listings, resellers) as of 2026-06-04; the 2026 memory shortage has pushed new prices up and pulled several high-memory configs from sale. Value = memory × bandwidth ÷ price (capability per dollar). Datacenter parts are quote-based and sold in 8-GPU boards, so their per-unit figures are approximate. "u" marks unified memory. Numbers move fast; treat them as a snapshot, not a quote.

Our build Lucebox Full disclosure, we make this one. An RTX 3090 (24GB, 936 GB/s) with 128GB unified memory and a tuned stack, up to 207 tok/s, plug-and-play and warrantied. Not the cheapest $/GB on the board, that is the point: it is the build done, tuned and warrantied, where stock setups trail by several times. Good value for the money, and the numbers above are here so you can judge for yourself. $4,900 See full details →