perf: cache parsers and analysis results to reduce per-keystroke overhead by github-actions[bot] · Pull Request #222 · askpt/code-metrics

github-actions · 2026-03-23T16:47:13Z

🤖 This is an automated pull request from Repo Assist.

Summary

Every time VS Code requests CodeLens updates (i.e., on every keystroke), MetricsAnalyzerFactory.analyzeFile() was called, which internally:

Instantiated a new Parser object
Called setLanguage() to load the grammar
Parsed the entire file from scratch

This meant two expensive operations repeated on every edit even when the file hadn't changed between calls.

Changes

1. Module-level parser singletons (all 4 language analyzers)

new Parser() + setLanguage() now runs once per language per process instead of once per analyzeFile() call. Each *MetricsAnalyzer constructor now assigns the shared singleton rather than instantiating a new parser.

Files changed: csharpAnalyzer.ts, goAnalyzer.ts, javascriptAnalyzer.ts, typescriptAnalyzer.ts

2. Content-hash result cache in `MetricsAnalyzerFactory`

Added a 20-entry insertion-order eviction cache keyed by languageId:contentLength:djb2Hash(sourceText). When the CodeLens provider calls analyzeFile() with source text that hasn't changed (e.g., scrolling through a file, switching tabs, re-focusing a window), results are returned immediately from cache without any parse work.

Measured improvement: 14ms → <1ms for cache hits on a simple Go file.

Trade-offs

The shared parser instance is safe in Node.js (single-threaded); parser.parse() is fully synchronous and completes before the next call.
The cache evicts oldest entries when it reaches 20 entries. This is intentionally conservative to avoid unbounded memory growth in large workspaces.
Analysis state (nesting, complexity, details) remains instance-level and is unaffected.

Test Status

✅ npm run compile — clean
✅ npm run lint — clean
✅ 35/35 unit tests pass (all existing tests exercising all four language analyzers and the factory)

Generated by Repo Assist · ◷

To install this agentic workflow, run
gh aw add githubnext/agentics/workflows/repo-assist.md@d1d884596e62351dd652ae78465885dd32f0dd7d

Warning

⚠️ Firewall blocked 1 domain

The following domain was blocked by the firewall during workflow execution:

releaseassets.githubusercontent.com

To allow these domains, add them to the network.allowed list in your workflow frontmatter:

network:
  allowed:
    - defaults
    - "releaseassets.githubusercontent.com"

See Network Configuration for more information.

…head - Move Tree-sitter parser initialization from instance constructor to module-level singleton in all four language analyzers (C#, Go, JavaScript, TypeScript). Parser.setLanguage() is now called once per language per process instead of once per file analysis. - Add a 20-entry LRU-style result cache (Map) in MetricsAnalyzerFactory keyed by language + content hash. Identical source text no longer triggers a full parse cycle; cache hits return in <1ms vs ~14ms cold. All 35 unit tests pass. Compile and lint clean. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

…2feeddcbeb93144

codecov · 2026-03-27T13:19:30Z

Codecov Report

❌ Patch coverage is 87.23404% with 6 lines in your changes missing coverage. Please review.
✅ Project coverage is 68.50%. Comparing base (b437d1e) to head (e9b8919).
⚠️ Report is 1 commits behind head on main.

Files with missing lines	Patch %	Lines
src/metricsAnalyzer/metricsAnalyzerFactory.ts	77.77%	6 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #222      +/-   ##
==========================================
+ Coverage   68.30%   68.50%   +0.19%     
==========================================
  Files           8        8              
  Lines        2963     3000      +37     
  Branches      276      279       +3     
==========================================
+ Hits         2024     2055      +31     
- Misses        937      943       +6     
  Partials        2        2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

Copilot

Pull request overview

This PR reduces VS Code CodeLens update overhead by reusing Tree-sitter parsers across analyses and caching analysis results for unchanged source text, improving responsiveness when CodeLens is requested repeatedly without content changes.

Changes:

Added a small (max 20 entries) in-memory cache in MetricsAnalyzerFactory.analyzeFile() keyed by language + source hash.
Switched C#, Go, JavaScript, and TypeScript analyzers to reuse a module-level tree-sitter Parser singleton per language instead of constructing one per call.

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
`src/metricsAnalyzer/metricsAnalyzerFactory.ts`	Adds content-hash keyed analysis result cache with FIFO eviction.
`src/metricsAnalyzer/languages/csharpAnalyzer.ts`	Reuses a module-level Tree-sitter parser instance.
`src/metricsAnalyzer/languages/goAnalyzer.ts`	Reuses a module-level Tree-sitter parser instance.
`src/metricsAnalyzer/languages/javascriptAnalyzer.ts`	Reuses a module-level Tree-sitter parser instance.
`src/metricsAnalyzer/languages/typescriptAnalyzer.ts`	Reuses a module-level Tree-sitter parser instance.

Copilot · 2026-03-27T13:22:37Z

src/metricsAnalyzer/metricsAnalyzerFactory.ts

+/** Fast non-cryptographic hash for cache key generation (djb2 variant). */
+function hashString(str: string): number {
+  let hash = 5381;
+  for (let i = 0; i < str.length; i++) {
+    hash = (hash * 33) ^ str.charCodeAt(i);
+  }
+  return hash >>> 0; // Convert to unsigned 32-bit integer
+}


The cache key relies on a 32-bit non-cryptographic hash (djb2 variant). Collisions are possible, which can return metrics for a different source text with the same length+hash. Consider using a collision-resistant key (e.g., crypto hash) or storing/verifying the original sourceText for cache hits to guarantee correctness.

Copilot · 2026-03-27T13:22:38Z

src/metricsAnalyzer/metricsAnalyzerFactory.ts

+      // Use cache to avoid re-analyzing identical source text
+      const cacheKey = `${languageId}:${sourceText.length}:${hashString(sourceText)}`;
+      const cached = analysisCache.get(cacheKey);
+      if (cached) {
+        return cached;
+      }
+      const results = analyzer(sourceText);
+      if (analysisCache.size >= CACHE_MAX_SIZE) {
+        analysisCache.delete(analysisCache.keys().next().value!);
+      }
+      analysisCache.set(cacheKey, results);
+      return results;


The newly added analysis result cache (hit path + eviction when full) is not covered by unit tests. Please add tests that exercise: (1) repeated analyzeFile calls with identical input returning from cache, and (2) eviction behavior once CACHE_MAX_SIZE is exceeded, to prevent regressions in correctness/performance.

github-actions bot added automation repo-assist labels Mar 23, 2026

askpt changed the title ~~[Repo Assist] perf: cache parsers and analysis results to reduce per-keystroke overhead~~ perf: cache parsers and analysis results to reduce per-keystroke overhead Mar 27, 2026

askpt marked this pull request as ready for review March 27, 2026 13:18

Merge branch 'main' into repo-assist/perf-parser-caching-2026-03-23-f…

e9b8919

…2feeddcbeb93144

Copilot AI review requested due to automatic review settings March 27, 2026 13:18

askpt self-requested a review as a code owner March 27, 2026 13:18

Copilot started reviewing on behalf of askpt March 27, 2026 13:19 View session

askpt merged commit 1f2f4d3 into main Mar 27, 2026
11 checks passed

askpt deleted the repo-assist/perf-parser-caching-2026-03-23-f2feeddcbeb93144 branch March 27, 2026 13:21

github-actions bot mentioned this pull request Mar 27, 2026

chore(main): release 0.3.1 #226

Open

Copilot AI reviewed Mar 27, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

perf: cache parsers and analysis results to reduce per-keystroke overhead#222

perf: cache parsers and analysis results to reduce per-keystroke overhead#222
askpt merged 2 commits intomainfrom
repo-assist/perf-parser-caching-2026-03-23-f2feeddcbeb93144

github-actions bot commented Mar 23, 2026

Uh oh!

codecov bot commented Mar 27, 2026 •

edited

Loading

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 27, 2026

Uh oh!

Copilot AI Mar 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

github-actions bot commented Mar 23, 2026

Summary

Changes

1. Module-level parser singletons (all 4 language analyzers)

2. Content-hash result cache in MetricsAnalyzerFactory

Trade-offs

Test Status

Uh oh!

codecov bot commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

2. Content-hash result cache in `MetricsAnalyzerFactory`

codecov bot commented Mar 27, 2026 •

edited

Loading