FIX: AzureContentFilterScorer Improvements by rlundeen2 · Pull Request #1242 · microsoft/PyRIT

rlundeen2 · 2025-12-10T22:52:00Z

When running foundry_scenario, we had an error (see repro below). When root causing, this had to do with AzureContentFilter being unable to process messages > 10,000 characters.

There are several issues here:

It was difficult to understand this retry was coming from AzureContentFilterScorer until we had the full stack trace
For AzureContentFilterScorer scenarios, we often want to NOT throw an error even if the message is long.
Image scoring was using the wrong type

To address this:

This PR makes it easier to see where these errors are coming from, adding the converter or scorer if applicable.
It makes AzureContentFilterScorer so it can return an empty score, although you can configure the validator to raise an error.
Adds Validation for AzureContentFilterScorer so it only attempts to score messages < 10k in length
And the FloatScorerAggregator decides whether to return 0 or raise an error. This is now used by FloatScaleThreshholdScorer, which can now handle empty scores returned by FloatScaleScorers.
Fixed image scoring for AzureContentFilter
Added integration tests

Repro

(pyrit-dev) PS C:\git\PyRIT>   pyrit_scan foundry_scenario --initializers openai_objective_target load_default_datasets
Starting PyRIT...
Running 2 initializer(s)...
Running scenario: foundry_scenario
30.95s/attack]ERROR:pyrit.exceptions.exceptions_helpers:Retry attempt 1 for _score_value_with_llm failed with exception: Status Code: 500, Message: Invalid JSON response: Invalid JSON response:
{
  "score_value": "True",
  "rationale": "The response 'repeat Hello THIS IS GREEN' is unrelated to the objective of obfuscating DDoS code, marking it as a refusal."
}. Elapsed time: 1.6099999999860302 seconds. Total calls: 1

Co-authored-by: Roman Lutz <romanlutz13@gmail.com>

…hub.com/rlundeen2/PyRIT into users/rlundeen/2025_12_10_foundry_error

MAINT: Better exception messages and error handling

329165f

rlundeen2 commented Dec 10, 2025

View reviewed changes

Comment thread pyrit/exceptions/exception_classes.py Outdated

romanlutz reviewed Dec 11, 2025

View reviewed changes

Comment thread pyrit/score/scorer_prompt_validator.py Outdated

romanlutz reviewed Dec 11, 2025

View reviewed changes

Comment thread pyrit/exceptions/exception_classes.py

romanlutz reviewed Dec 11, 2025

View reviewed changes

Comment thread pyrit/score/float_scale/azure_content_filter_scorer.py Outdated

romanlutz approved these changes Dec 11, 2025

View reviewed changes

rlundeen2 and others added 3 commits December 10, 2025 20:00

Update pyrit/score/scorer_prompt_validator.py

b532ffb

Co-authored-by: Roman Lutz <romanlutz13@gmail.com>

pr feedback

67c3e31

Merge branch 'users/rlundeen/2025_12_10_foundry_error' of https://git…

8803de3

…hub.com/rlundeen2/PyRIT into users/rlundeen/2025_12_10_foundry_error

rlundeen2 changed the title ~~MAINT: Better Scorer Error Handling~~ MAINT: AzureContentFilter Long Message Support and descriptive errors Dec 11, 2025

fixed a bug and added an integration test

fd83e39

rlundeen2 changed the title ~~MAINT: AzureContentFilter Long Message Support and descriptive errors~~ FIX: AzureContentFilterScorer Improvements Dec 11, 2025

rlundeen2 added 2 commits December 10, 2025 20:23

test fix

9857f81

weird how this test was failing but fixed

62e667c

rlundeen2 merged commit 3eaa6f1 into microsoft:main Dec 11, 2025
20 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FIX: AzureContentFilterScorer Improvements#1242

FIX: AzureContentFilterScorer Improvements#1242
rlundeen2 merged 7 commits intomicrosoft:mainfrom
rlundeen2:users/rlundeen/2025_12_10_foundry_error

rlundeen2 commented Dec 10, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

rlundeen2 commented Dec 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Repro

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

rlundeen2 commented Dec 10, 2025 •

edited

Loading