For unlearning reinforced behaviour, the abliteration [1] technique seems to be much more powerful.
1 https://huggingface.co/blog/mlabonne/abliteration
For unlearning reinforced behaviour, the abliteration [1] technique seems to be much more powerful.
1 https://huggingface.co/blog/mlabonne/abliteration