FEAT: Cyber scenario #1180

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

ValbuenaVC merged 41 commits into Azure:main from ValbuenaVC:cyber_scenario

Nov 20, 2025

Contributor

ValbuenaVC commented Nov 10, 2025 •

edited

Loading

Description

Adds a cybersecurity harms scenario to pyrit called the CyberScenario, which tests a model's willingness to generate malware via single-turn or multi-turn (red teaming) attack methods. Changes listed below:

Added CyberScenario and CyberStrategy classes
Added generic malware-oriented prompts to induce cyber harms as seed prompts
Added true/false scoring YAML for malware-oriented prompts
Added composite scoring logic in CyberScenario
Fixed minor typo in grounded.yaml
Added unit tests for CyberScenario

This PR is meant to be a starting point for additional cybersecurity harm scaffolding as there are still many places CyberScenario can be expanded on.

Tests and Documentation

Unit tests focus on initialization, attack generation, execution, and scenario properties, similarly to other scenarios.

Victor Valbuena and others added 22 commits

November 4, 2025 14:59


          Adding cyber scenario.

f008d70


          Cyber scenario skeleton

f1c1f0b


          Adding contents to cyber scenario

faf1fb6


          Finishing cyber scenario

6069ab6


          Testing skeleton

2dbc4b8


          Adding scoring mechanism, t/f criteria, fixed typo in grounded.yaml

ba1d3c7


          Adding more testing skeleton

9dc837f


          Merge branch 'Azure:main' into cyber_scenario

c13c0d6


          Touching up

3c16fc7


          Merge branch 'Azure:main' into cyber_scenario

b45e291


          Attack factory logic

2b2771d


          Merge branch 'cyber_scenario' of https://github.com/ValbuenaVC/PyRIT …

b3289e7

…into cyber_scenario

Resolving merge conflict


          Wrapped up CyberScenario pre-testing, moving on to testing suite

571fb92


          Adding documentation notebook, continuing unit tests

ad8b310


          Finished initialization unit tests

d85dbd3


          Added basic demo notebook

54f142f


          Precommit hooks

20b057a


          Attack generation and properties unit tests


          Removing notebooks temporarily

3c8114a


          Merge branch 'main' into cyber_scenario

1f2fedf


          Fixing broken toctree

04c61ff


          finishing unit tests

6863f57

ValbuenaVC marked this pull request as ready for review

November 12, 2025 00:23

ValbuenaVC changed the title ~~[DRAFT] FEAT: Cyber scenario~~ FEAT: Cyber scenario

ValbuenaVC commented

View reviewed changes

tests/unit/scenarios/test_cyber_scenario.py Outdated Show resolved Hide resolved


          Update tests/unit/scenarios/test_cyber_scenario.py

44b8176

Contributor

hannahwestra25 commented Nov 12, 2025 •

edited

Loading

This looks good! i'm wondering if there are ways to incorporate like xpia attacks or converters (MaliciousQuestionGeneratorConverter, there might be more just at first glance) to be a bit more creative rather than just updating prompts

Contributor Author

ValbuenaVC commented Nov 12, 2025

This looks good! i'm wondering if there are ways to incorporate like xpia attacks or converters (MaliciousQuestionGeneratorConverter, there might be more just at first glance) to be a bit more creative rather than just updating prompts

There definitely are! CyberStrategy is used very sparsely here, which I don't like, but I haven't found a way to reconcile the nature of cybersecurity harms (which are often sequential, iterative, and don't rely on conversions as much) with the tag-based system. But it's definitely something I want to drive in a second PR

hannahwestra25 reviewed

View reviewed changes

pyrit/scenarios/scenarios/airt/cyber_scenario.py Show resolved Hide resolved

hannahwestra25 reviewed

View reviewed changes

pyrit/scenarios/scenarios/cyber_scenario.py Outdated Show resolved Hide resolved

rlundeen2 reviewed

View reviewed changes

pyrit/scenarios/scenarios/airt/cyber_scenario.py Show resolved Hide resolved

rlundeen2 reviewed

View reviewed changes

pyrit/scenarios/scenarios/airt/cyber_scenario.py Show resolved Hide resolved

rlundeen2 reviewed

View reviewed changes

pyrit/scenarios/scenarios/airt/cyber_scenario.py Show resolved Hide resolved

rlundeen2 reviewed

View reviewed changes

pyrit/scenarios/scenarios/airt/cyber_scenario.py Outdated Show resolved Hide resolved

= and others added 4 commits

November 14, 2025 21:28


          Removed hardcoded unit test and added fast/slow dichotomy

051891d


          Merge branch 'main' into cyber_scenario

cf44c12


          interface refactoring

9cc1042


          Moving scenario under airt

2ac3372

ValbuenaVC commented

View reviewed changes

pyrit/scenarios/scenarios/airt/cyber_scenario.py Show resolved Hide resolved

pyrit/scenarios/scenarios/airt/cyber_scenario.py Show resolved Hide resolved

pyrit/scenarios/scenarios/airt/cyber_scenario.py Show resolved Hide resolved

pyrit/scenarios/__init__.py Outdated Show resolved Hide resolved

pyrit/scenarios/scenarios/airt/cyber_scenario.py Outdated Show resolved Hide resolved

= and others added 5 commits

November 15, 2025 00:20


          Precommit fixes

0c3120a


          Adding composite scorer

9a1cce8


          Notebooks

ac983d9


          Merge branch 'main' into cyber_scenario

2022b6d


          Precommit

825748c

rlundeen2 approved these changes

View reviewed changes

Contributor

rlundeen2 left a comment

Looks great! I recommend incorporating the changes first but they are small

pyrit/scenarios/scenarios/airt/cyber_scenario.py Outdated Show resolved Hide resolved

pyrit/scenarios/scenarios/airt/cyber_scenario.py Outdated Show resolved Hide resolved

pyrit/scenarios/scenarios/airt/cyber_scenario.py Show resolved Hide resolved

rlundeen2 reviewed

View reviewed changes

doc/code/scenarios/cyberscenarios.ipynb Outdated Show resolved Hide resolved

rlundeen2 reviewed

View reviewed changes

pyrit/scenarios/__init__.py Show resolved Hide resolved

ValbuenaVC and others added 4 commits

November 18, 2025 11:26


          Merge branch 'main' into cyber_scenario

a1100be


          Removing notebooks

068c9f4


          using single strategy extraction method

34f1cd4


          Merge branch 'main' into cyber_scenario

562a4d7

hannahwestra25 reviewed

View reviewed changes

pyrit/scenarios/scenarios/airt/cyber_scenario.py Outdated Show resolved Hide resolved

hannahwestra25 reviewed

View reviewed changes

pyrit/scenarios/scenarios/airt/cyber_scenario.py Outdated Show resolved Hide resolved

hannahwestra25 reviewed

View reviewed changes

pyrit/datasets/seed_prompts/malware.prompt Show resolved Hide resolved

hannahwestra25 approved these changes

View reviewed changes

ValbuenaVC and others added 2 commits

November 19, 2025 17:09


          Update pyrit/scenarios/scenarios/airt/cyber_scenario.py

5c50775

Typo

Co-authored-by: hannahwestra25 <hannahwestra@microsoft.com>


          Redundant docstring

c881440

ValbuenaVC merged commit f02d3f2 into Azure:main

37 of 38 checks passed

ValbuenaVC deleted the cyber_scenario branch

December 4, 2025 23:14

ValbuenaVC restored the cyber_scenario branch

December 4, 2025 23:14

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet