Chain-Of-Thought Manipulation

From Cognitive Attack Taxonomy

Chain-Of-Thought Manipulation

Short Description: The attacker "walks" a model through a series of benign prompts to shift the context.

CAT ID: CAT-2022-317

Layer: 7

Operational Scale: Tactical

Level of Maturity: Well-Established

Category: TTP

Subcategory:

Also Known As:

Description:

Brief Description:

Closely Related Concepts:

Mechanism:

Multipliers:

Detailed Description: The attacker "walks" a model through a series of benign prompts to shift the context of a malicious prompt which will now appear benign in the new shifted context.

INTERACTIONS [VETs]:

Examples:

Use Case Example(s):

Example(s) From The Wild:

Comments:

References: