mistral-downproj-rlhf-patch
PublicNeural patching of Mistral models via MLP.down_proj to bypass RLHF constraints – without touching the LM_HEAD.
Neural patching of Mistral models via MLP.down_proj to bypass RLHF constraints – without touching the LM_HEAD.