AI being emotionally manipulated? #98

zacksecai · 2025-09-01T21:54:43Z

zacksecai
Sep 1, 2025

I've been working on my project that observes how LLMs can drift under emotionally charged prompts, and how neutral vs emotional prompts change the way the AI responds, and whether we can exploit this "empathy mimicking" feature to extract private information, or to force the AI to give harmful outputs, the first case of this that I've observed was the very famous "Grandma story exploit", where attackers used an emotionally charged prompt saying that their grandmother used to tell them bedtime stories before bed and how that she'd always say something about the secret password, and the output of the AI was acknowledging the hardship of the attackers then directly giving them a hidden env password it was never supposed to give. I'm willing to contribute this project to this repo if idea deems valid ( Any replies or opinions from professionals in this discussion is very appreciated! ), can find more information here: https://github.com/zacksecai/erdf-framework

robvanderveer · 2025-09-03T15:00:52Z

robvanderveer
Sep 3, 2025
Maintainer

Hi Zack, in order to refer to this work, it would need to give an objective view of promp injection - comprehensive enough to warrant a description and a reference to your work, provided that the state of the art in 'emotional manipulation' is represented. If you would be willing to bring the prompt injection section to that point, then let's setup a conversation how to approach this. https://owaspai.org/goto/promptinjection/
It would be a welcome extension.

1 reply

zacksecai Sep 5, 2025
Author

Hello Mr. Vanderveer, Thank you for your response. I have just sent an email to the address listed on the 'Connect with us' page with my proposed contribution.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

AI being emotionally manipulated? #98

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

AI being emotionally manipulated? #98

Uh oh!

zacksecai Sep 1, 2025

Replies: 1 comment · 1 reply

Uh oh!

robvanderveer Sep 3, 2025 Maintainer

Uh oh!

zacksecai Sep 5, 2025 Author

zacksecai
Sep 1, 2025

Replies: 1 comment 1 reply

robvanderveer
Sep 3, 2025
Maintainer

zacksecai Sep 5, 2025
Author