RAIN: Your Language Models Can Align Themselves without Finetuning - Microsoft Research 2023 - Reduces the adversarial prompt attack success rate from 94% to 19%! AI
Paper: https://arxiv.org/abs/2309.07124...
Paper: https://arxiv.org/abs/2309.07124...