I use quantized LLMs in production and can't say I ever found the models to be l... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		underlines on Nov 4, 2024 \| parent \| context \| favorite \| on: An embarrassingly simple approach to recover unlea... I use quantized LLMs in production and can't say I ever found the models to be less censored. For unlearning reinforced behaviour, the abliteration [1] technique seems to be much more powerful. 1 https://huggingface.co/blog/mlabonne/abliteration

ClassyJacket on Nov 4, 2024 [–]

Were you using models that had been unlearned using gradient ascent specifically?

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact