Wrt. language models/transformers, the neural engine/NPU is still potentially us... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		zozbot234 6 months ago \| parent \| context \| favorite \| on: Apple M5 chip Wrt. language models/transformers, the neural engine/NPU is still potentially useful for the pre-processing step, which is generally compute-limited. For token generation you need memory bandwidth so GPU compute with neural/tensor accelerators is preferable.

fooblaster 6 months ago [–]

I think I'd still rather have the hardware area put into tensor cores for the GPU instead of this unit that's only programmable with onnx.

Consider applying for YC's Summer 2026 batch! Applications are open till May 4
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact