Skip to content

Commit 0dedfcc

Browse files
docs: update evaluation pipeline instructions to reference Cloud Build and maintainer review process
1 parent 8296b10 commit 0dedfcc

1 file changed

Lines changed: 1 addition & 1 deletion

File tree

DEVELOPER.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -71,7 +71,7 @@ When adding new skills or modifying existing behavior, you should add or update
7171
1. Open `evals/gemini_dataset.json` (and/or `evals/claude_dataset.json`).
7272
2. Add a new scenario block with a unique `id`, a clear `starting_prompt`, a detailed `conversation_plan`, and the `expected_trajectory` of tool calls.
7373
3. Apply the `ci:run-evals` label while creating your pull request to trigger the evaluation pipeline.
74-
4. Evaluation metrics and outcomes are uploaded to BigQuery and can be monitored on the team's centralized evaluation dashboards.
74+
4. The evaluation pipeline runs securely via Cloud Build. A maintainer will review the internal logs and results to verify your scenarios pass successfully.
7575

7676
### Other GitHub Checks
7777

0 commit comments

Comments
 (0)