Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
binary132
6 days ago
|
parent
|
context
|
favorite
| on:
The $100B megadeal between OpenAI and Nvidia is on...
I find the Q8 runs a bit more than twice as fast as gpt-120b since I don’t have to offload as many MoE layers, but is just about as capable if not better.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: