Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions .agents/skills/e2e-tests/SKILL.md
Original file line number Diff line number Diff line change
Expand Up @@ -64,6 +64,7 @@ Cassettes mock provider HTTP responses (OpenAI, Anthropic, ...) so external-prov
- When a scenario family already has `assertions.ts`, keep version- or provider-specific test setup in `scenario.test.ts` and reuse the shared assertions file.
- Keep the CI e2e summary up to date. If a scenario version matrix or `variantKey` changes, update `e2e/config/pr-comment-scenarios.json` in the same change and follow the established pattern used by other versioned scenarios: one summary row per version, not separate wrapped/auto rows unless that pattern already exists for the scenario family.
- Run new or updated scenarios three times in a row before considering snapshots stable.
- Do not add tests directly asserting on the auto instrumentation configs.

## Scenario Patterns

Expand Down
5 changes: 5 additions & 0 deletions .changeset/clear-months-fold.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
---
"braintrust": minor
---

feat: Add `@flue/runtime` instrumentation
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We should document what this instruments here, and the new public API added (wrapFlueSession and wrapFlueContext).

12 changes: 12 additions & 0 deletions e2e/config/pr-comment-scenarios.json
Original file line number Diff line number Diff line change
Expand Up @@ -162,6 +162,18 @@
"metadataScenario": "cursor-sdk-instrumentation",
"variants": [{ "variantKey": "cursor-sdk-v1", "label": "v1" }]
},
{
"scenarioDirName": "flue-instrumentation",
"label": "Flue Instrumentation",
"metadataScenario": "flue-instrumentation",
"variants": [
{ "variantKey": "flue-v0-7-0-wrapped", "label": "v0.7.0 wrapped" },
{
"variantKey": "flue-v0-7-0-auto-hook",
"label": "v0.7.0 auto-hook"
}
]
},
{
"scenarioDirName": "github-copilot-instrumentation",
"label": "GitHub Copilot SDK Instrumentation",
Expand Down
Original file line number Diff line number Diff line change
@@ -0,0 +1,6 @@
---
name: e2e-flue-skill
description: Deterministic skill used by the Braintrust Flue instrumentation e2e test.
---

Return the marker from the provided args exactly once. Output no other text.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Loading
Loading