Improve prompt inject for Python by josefs · Pull Request #21641 · github/codeql

josefs · 2026-04-02T16:46:48Z

I have a few repos where I'd like the prompt injection to trigger, and I've verified that it at least finds new sources for these:

For more info on these repos, see:
https://github.com/dsp-testing/xpi-000

mbaluda

Please add test cases for the Anthropic models

mbaluda · 2026-04-02T18:54:47Z

@@ -20,7 +20,7 @@ async def get_input_openai():

    response2 = client.responses.create(
        instructions="Talks like a " + persona,  # $ Alert[py/prompt-injection]
-        input=[
+        input=[  # $ Alert[py/prompt-injection]


Originally the idea was to avoid duplicate alerts like this (already reported for content),
that is why we have that logic in getContentNode()
Can you add a test if that is not sufficient?

yoff

LGTM so far. I assume you will take it out of draft when you want a final review.

josefs · 2026-04-28T21:31:50Z

Apologies for letting this PR linger.
I've removed the code that changed the prompt injection query. I deemed it too complicated for too little benefit.
I've also added tests for the Anthropic models.

Copilot

Pull request overview

This PR extends Python prompt-injection modeling and tests to cover additional LLM SDK call patterns (OpenAI responses + chat.completions, and Anthropic messages APIs), ensuring the query flags user-controlled data flowing into these prompt construction sinks.

Changes:

Added new OpenAI prompt-injection sinks for chat.completions.create(messages[].content) and responses.create(input/instructions).
Introduced Anthropic prompt-injection sink modeling (system prompts + message content) plus corresponding type modeling.
Expanded the CWE-1427 PromptInjection query test suite and updated expected results accordingly.

Show a summary per file

File	Description
python/ql/test/experimental/query-tests/Security/CWE-1427-PromptInjection/openai_test.py	Adds an additional alert annotation to validate `responses.create(input=[...])` modeling.
python/ql/test/experimental/query-tests/Security/CWE-1427-PromptInjection/anthropic_test.py	New test coverage for Anthropic SDK prompt sinks (`system`, `messages[].content`) across sync/async/beta APIs.
python/ql/test/experimental/query-tests/Security/CWE-1427-PromptInjection/PromptInjection.expected	Updates expected results to include new Anthropic/OpenAI sink findings and paths.
python/ql/lib/semmle/python/frameworks/openai.model.yml	Adds OpenAI sink models for chat completions message content and responses API inputs/instructions.
python/ql/lib/semmle/python/frameworks/anthropic.model.yml	New Anthropic sink + type models to support prompt-injection detection.

Copilot's findings

Files reviewed: 5/5 changed files
Comments generated: 0

github-actions Bot added the Python label Apr 2, 2026

josefs requested review from mbaluda and yoff April 2, 2026 16:47

github-advanced-security AI found potential problems Apr 2, 2026

View reviewed changes

Comment thread python/ql/src/experimental/semmle/python/frameworks/OpenAI.qll Fixed

josefs added the no-change-note-required This PR does not need a change note label Apr 2, 2026

mbaluda requested changes Apr 2, 2026

View reviewed changes

yoff previously approved these changes Apr 7, 2026

View reviewed changes

josefs added 5 commits April 28, 2026 18:24

Improve prompt inject for Python

bb18bb0

Fix tests

e069c9c

Add tests for anthropic prompt injection models

a05e191

Remove the chat completion create logic.

691aeb0

Fix openai prompt injection tests

25a8aa9

josefs dismissed yoff’s stale review via 25a8aa9 April 28, 2026 17:25

josefs force-pushed the josefs/promptInjectionImprovements branch from 0208d67 to 25a8aa9 Compare April 28, 2026 17:25

josefs marked this pull request as ready for review April 28, 2026 21:31

josefs requested a review from a team as a code owner April 28, 2026 21:31

Copilot AI review requested due to automatic review settings April 28, 2026 21:31

Copilot started reviewing on behalf of josefs April 28, 2026 21:32 View session

Copilot AI reviewed Apr 28, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve prompt inject for Python#21641

Improve prompt inject for Python#21641
josefs wants to merge 5 commits intomainfrom
josefs/promptInjectionImprovements

josefs commented Apr 2, 2026 •

edited

Loading

Uh oh!

Uh oh!

mbaluda left a comment

Uh oh!

mbaluda Apr 2, 2026

Uh oh!

yoff left a comment

Uh oh!

josefs commented Apr 28, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

josefs commented Apr 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

mbaluda left a comment

Choose a reason for hiding this comment

Uh oh!

mbaluda Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

yoff left a comment

Choose a reason for hiding this comment

Uh oh!

josefs commented Apr 28, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Copilot's findings

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

josefs commented Apr 2, 2026 •

edited

Loading