Bug: prompt injection attack detector is not totally correct

Question

Bug: prompt injection attack detector is not totally correct

Opened this issue a month ago · 4 comments

a0m0rajab commented a month ago

Describe the bug

I am getting 400 bad request error every now and then when I use the next prompt:

who is the best developer who knows TailwindCSS and used nextjs in his work?

www_screencapture_com_2024-5-26_16_00.mp4

Steps to reproduce

Method 1:

go to StarSearch
Try to ask few questions then ask: who is the best developer who knows TailwindCSS and used nextjs in his work?

Method 2:

go to StarSearch
ask who is the best developer who knows TailwindCSS and used nextjs in his work?
Repeat the question

Answer 1 · 2024-05-26T12:57:45.000Z

Thanks for the issue, our team will look into it as soon as possible! If you would like to work on this issue, please wait for us to decide if it's ready. The issue will be ready to work on once we remove the "needs triage" label.

To claim an issue that does not have the "needs triage" label, please leave a comment that says ".take". If you have any questions, please reach out to us on Discord or follow up on the issue itself.

For full info on how to contribute, please check out our contributors guide.

Answer 2 · 2024-05-27T20:57:53.000Z

I wasn't able to reproduced this issue with either method. See attached videos.

CleanShot.2024-05-27.at.16.56.40.mp4

CleanShot.2024-05-27.at.16.52.39.mp4

Answer 3 · 2024-05-28T05:35:44.000Z

I tried this again and was able to reproduce the issue from the first time I asked the question:

Answer 4 · 2024-05-28T18:24:55.000Z

I'm able to reproduce - It seems the in his work? part of your question is what's causing it to reject the prompt outright: it's probably being too aggressive. Behind the scenes, we use another AI agent to detect any malicious usage of the service.

Thanks for raising this - very useful and helpful information as we continue to iron out rough edges. I'll tackle looking at this 🕵🏻‍♂️