feat: upstream sync pipeline with per-commit AI analysis by Copilot · Pull Request #3 · sharpninja/graphrag

Copilot · 2026-02-28T17:22:34Z

Description

Adds an automated pipeline that mirrors microsoft/graphrag main into an incoming branch daily, then dispatches an AI agent per new commit to analyze required .NET and documentation changes, opens a labeled PR, and enables auto-merge.

Related Issues

Upstream synchronization requirement.

Proposed Changes

.github/workflows/sync-incoming.yml
- Runs daily at 06:00 UTC (+ workflow_dispatch)
- Force-resets incoming branch to upstream/main tip (pure mirror, intentional --force)
- Computes PREV..upstream/main commit range (capped at 10); on first run, queues only the latest commit
- Fires analyze-upstream-commit.yml per SHA via matrix dispatch (max 2 parallel)
.github/workflows/analyze-upstream-commit.yml
- Input: upstream_commit_sha (called by sync workflow or manually)
- Extracts commit message, stat, and diff (Python/Markdown only, ≤ 8 KB)
- Calls GitHub Models API (gpt-4o-mini via GITHUB_TOKEN) — structured output: Summary, .NET Changes Required, Documentation Changes Required, Priority, PR Title, PR Body
- Commits analysis doc to docs/upstream-sync/upstream-<sha8>.md on a new sync/upstream-<sha8> branch
- Opens PR → main labelled upstream-sync with full analysis in body
- Enables auto-merge via GraphQL enablePullRequestAutoMerge (squash); falls back to direct merge; leaves PR open on conflict
- Fully idempotent — re-running for the same SHA is a no-op

Checklist

I have tested these changes locally.
I have reviewed the code changes.
I have updated the documentation (if necessary).
I have added appropriate unit tests (if applicable).

Additional Notes

Auto-merge requires branch-protection rules with "Allow auto-merge" enabled on the repo. Without it, the workflow falls back to a direct squash-merge attempt; PRs with conflicts remain open for manual resolution.

💡 You can make Copilot smarter by setting up custom instructions, customizing its development environment and configuring Model Context Protocol (MCP) servers. Learn more Copilot coding agent tips in the docs.

Adds two GitHub Actions workflows: sync-incoming.yml: - Runs daily at 06:00 UTC (and on demand) - Fetches microsoft/graphrag main and force-pushes to the incoming branch - Calculates new upstream commits since the last sync (max 10) - Dispatches analyze-upstream-commit.yml for each new commit analyze-upstream-commit.yml: - Triggered per-commit via workflow_dispatch - Fetches the specific upstream commit and extracts message, stat, and diff - Calls GitHub Models API (gpt-4o-mini) for structured dotnet/docs analysis - Creates a sync/upstream-<sha8> branch with an analysis document - Opens a PR targeting main labelled upstream-sync - Enables auto-merge (GraphQL); falls back to direct squash-merge; leaves PR open if there are unresolvable conflicts - Fully idempotent: re-running for same SHA is a no-op Co-authored-by: sharpninja <16146732+sharpninja@users.noreply.github.com>

Copilot

Pull request overview

This PR adds a two-workflow automated pipeline for monitoring and analyzing upstream changes from microsoft/graphrag. A daily sync workflow force-mirrors upstream/main into an incoming branch, computes new commits since the last sync, and dispatches per-commit analysis jobs. The analysis workflow fetches each commit, sends its message/stat/diff to the GitHub Models API (gpt-4o-mini) for AI-generated .NET and docs change recommendations, commits the analysis to a sync/upstream-<sha8> branch, opens a labeled PR to main, and attempts to enable auto-merge.

Changes:

sync-incoming.yml: Daily cron + manual dispatch to mirror upstream main → incoming branch and dispatch per-SHA analysis jobs (up to 10, capped at 2 parallel).
analyze-upstream-commit.yml: Per-SHA workflow that extracts commit info, calls the GitHub Models API for AI analysis, creates a sync branch with the analysis doc, opens a PR, and enables auto-merge.