FEAT Breaking: Dataset Initializer and Scenario Datasets #1224

rlundeen2 · 2025-12-05T17:22:09Z

Description of Three changes

This should and could be three PRs. But in my first change, I ended up having to make several other changes.

First Change: Scenario Dataset Initialization

This is the problem I set out to tackle. But if we break it up this needs to go in last.

Added a scenario abstract method required_datasets for scenarios to define required default dataset names.
Updated the CLI to display required_datasets as part of the scenario info
Created load_default_datasets, a PyRItInitializer which loads all necessary datasets into memory using required_datasets and DatasetProvider
Updated scenarios to always load default datasets from memory (previous this was mostly yaml), which allows more flexibility in choosing.
Updated docs to often use required_datasets, since it's a really easy way to get started.

One problem: PyRItInitializer was synchronous, but we needed it to be async to use `DatasetProvider` and load SeedPrompts into memory

Ideally this would have been a first PR. But it is a change I didn't know we needed until almost done with the first part (I originally wrapped with asyncio, but that was breaking notebooks and other eventloops).

This is likely something we want anyway. Many PyRIT functions are async, and this gives us flexibility to use them.
I would have rather done this later, but there was no non-kludgy way to make required_datasets without async
There are a few layers that also needed to be updated to async to support this. Most logic was in the front end code, non-breaking for a release
The biggest breaking change is initialze_pyrit needs to be async to support this. Which changes a lot of docs. (again I like this pre-release)

Second problem: Many notebooks failing with target updates

And when running integration tests, I fixed times when the targets weren't working due to model not being defined.

PR Note: Again, this would be nice as a separate PR, but I was running this to fix my own changes (which essentially breaks every notebook due to initialize_pyrit updates) and stumbled upon this

PR Strategy

For the record, I'd say 80% of the code is just re-executing the jupyter notebooks because nearly every one had to be updated to use asynchronous initialize_pyrit_async. And because all the output is new, that's a lot of files and LOC.

This could be broken into three PRs, but it would go into reverse order and is a bit tough to pull apart. I'd prefer to keep as is because it's a lot less work. I recognize this adds difficult to the reviewer, but if it's too difficult to review, please raise this and I can break apart. But I think a lot of that breaking up would be manual

Tests and Docs

commit: 7a86bf4

All integration tests pass.
Except 11_harm_categories, but that was previously broken. I am tempted to remove this file until we can fix it, but the fix is non-trivial. Right now it only works if you're sending SeedPrompts (not SeedObjectives) with a harm category. As such, it won't work with most of our prompts or any multi_turn scenarios, so the behavior is fairly unpredictable. Maybe another PR to remove this before release.
I ran all scenarios. I want to create integration tests here but will wait for another PR.

pyrit/setup/initializers/pyrit_initializer.py

pyrit/setup/initialization.py

romanlutz · 2025-12-07T14:00:04Z

I don't think we have integration tests covering this yet. We probably should...

doc/code/scenarios/1_composite_scenario.py

tests/unit/memory/memory_interface/memory_interface.py

pyrit/cli/pyrit_shell.py

pyrit/setup/initializers/scenarios/load_default_datasets.py

pyrit/cli/scenario_registry.py

doc/code/auxiliary_attacks/0_auxiliary_attacks.py

doc/code/setup/pyrit_initializer.py

doc/code/targets/4_non_llm_targets.py

doc/code/targets/playwright_target.py

doc/code/targets/playwright_target_copilot.py

pyrit/cli/pyrit_shell.py

rlundeen2 added 14 commits December 4, 2025 09:40

docs update

f43cb34

pre-commit

37a8e3c

pre-commit

94e70c2

pre-commit and tests

0af98ab

merging main

586d5ae

updating a few

8e11d5d

merging main

0ab85a3

scenario loader

dc28e01

updating scenario calls to include datasets

5a06cd4

updating initialize_async

ee1e505

plumbing async through and updating tests

a27c1f4

pre-commit

fdde70f

fixing docs

9c0dec2

regenerated notebooks and pre-commit

7a86bf4

rlundeen2 changed the title ~~Draft FEAT: Dataset Initializer and Scenario Datasets~~ FEAT Breaking: Dataset Initializer and Scenario Datasets Dec 6, 2025

rlundeen2 marked this pull request as ready for review December 6, 2025 22:47

rlundeen2 added 4 commits December 6, 2025 14:52

pre-commit

1ee0d16

Merge branch 'main' into users/rlundeen/12_5_dataset_initializer

8cd8cea

pre commit

bf98d0c

test fixes

1fdc5b1