Prompt-injection threat model
The defining security problem of tool-using agents. When an agent ingests external content — a web page, a file — hidden instructions in that content can hij…
The defining security problem of tool-using agents. When an agent ingests external content — a web page, a file — hidden instructions in that content can hij…
The defining security problem of tool-using agents. When an agent ingests external
content — a web page, a file — hidden instructions in that content can hijack its
behaviour, and this cannot be fully defended with smarter prompts alone; isolating
the execution environment is the reliable defence. This topic connects directly to
the cognitive-overload attack in EB-1 Advanced.