fix: enforce max_workers in LLMMetadataExtractor.run_async by etairl · Pull Request #11248 · deepset-ai/haystack

etairl · 2026-05-04T17:13:31Z

Summary

LLMMetadataExtractor.run_async acquires its asyncio.Semaphore once around the outer gather(...) instead of inside each task, so max_workers has no effect and every prompt in a batch fires its LLM call simultaneously.
The docstring on __init__ advertises max_workers as "the maximum number of requests that should be allowed to run concurrently when using the run_async method", so the current behavior silently breaks that contract and can blow up LLM-provider rate limits / connection pools on large batches.
Fix is to move the async with sem: into a per-task wrapper coroutine so the limit is actually enforced. Added a regression test that verifies peak in-flight calls stay <= max_workers.

Before

sem = Semaphore(max(1, self.max_workers))
async with sem:
    results = await gather(*[self._run_async(prompt) for prompt in all_prompts])

After

sem = Semaphore(max(1, self.max_workers))

async def _bounded_run(prompt: ChatMessage | None) -> dict[str, Any]:
    async with sem:
        return await self._run_async(prompt)

results = await gather(*[_bounded_run(prompt) for prompt in all_prompts])

Test plan

hatch run test:unit -k test_llm_metadata_extractor passes (includes new test_run_async_respects_max_workers).
CI green.

The asyncio.Semaphore intended to bound concurrent LLM calls was acquired once around the outer gather(...) call instead of inside each task, so max_workers had no effect in run_async and all batched LLM requests fired simultaneously. Move the semaphore acquisition into a per-task wrapper so the documented concurrency cap is honored.

vercel · 2026-05-04T17:13:38Z

@etairl is attempting to deploy a commit to the deepset Team on Vercel.

A member of the Team first needs to authorize it.

etairl requested a review from a team as a code owner May 4, 2026 17:13

etairl requested review from bogdankostic and removed request for a team May 4, 2026 17:13

github-actions Bot added topic:tests type:documentation Improvements on the docs labels May 4, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: enforce max_workers in LLMMetadataExtractor.run_async#11248

fix: enforce max_workers in LLMMetadataExtractor.run_async#11248
etairl wants to merge 1 commit intodeepset-ai:mainfrom
etairl:fix/llm-metadata-extractor-async-semaphore

etairl commented May 4, 2026

Uh oh!

vercel Bot commented May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

etairl commented May 4, 2026

Summary

Before

After

Test plan

Uh oh!

vercel Bot commented May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant