Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Just pick up any >240GB VRAM GPU off your local BestBuy to run a quantized version.

> The full Kimi K2.5 model is 630GB and typically requires at least 4× H200 GPUs.



You could run the full, unquantized model at high speed with 8 RTX 6000 Blackwell boards.

I don't see a way to put together a decent system of that scale for less than $100K, given RAM and SSD prices. A system with 4x H200s would cost more like $200K.


That would be quite the space heater, too!




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: