Cyber and Security Affairs

I – WORLD NEWS
II – CYBERSPACE
- Analysis
- Articles
III – SECURITY
IV – MILITARY ISSUES
V – INTERVIEWS, EDITORIALS & OP-ED
CART

Meta's AI Safety System Manipulated by Space Bar Characters to Enable Prompt Injection

A bug hunter discovered a bypass in Meta’s Prompt-Guard-86M model by inserting character-wise spaces between English alphabet characters, rendering the classifier ineffective in detecting harmful content.
Source: cyware.com

This entry was posted in World News and tagged More on 30 July 2024 by webmaster.

Post navigation

← US State Department Says UN Cybercrime Treaty Must Include Human Rights Protections

Search for: