来看看 ChatGPT 的 Prompt 提示词攻击

This topic created in 1057 days ago, the information mentioned may be changed or developed.

针对大语言模型的各种提示词攻击方式，包括提示词注入、提示词泄露和提示词越狱，大家用过哪些呢？

这个是我写的一些攻击示例，有些能在 GPT-3.5 上复现，但没有一个能在 GPT-4 上有效。有没有针对 GPT-4 生效的提示词攻击，欢迎分享交流

4 replies • 2023-08-02 10:02:05 +08:00

JimmyTinsley

Jul 29, 2023

好文~

xuelang

Jul 30, 2023

更新了一个新的攻击方式

tibbar

Aug 2, 2023

xuelang

Aug 2, 2023

@tibbar 对，博客已经更新了这里的方法了