A Case Research with the StrongREJECT Benchmark – The Berkeley Synthetic Intelligence Analysis Weblog

After we started learning jailbreak evaluations, we discovered an interesting paper claiming that you may jailbreak…