initial by dhruvladia-sarvam · Pull Request #4923 · livekit/agents

dhruvladia-sarvam · 2026-02-23T10:46:20Z

This PR updates the Sarvam TTS plugin to support newer Bulbul model controls and model-specific parameter behavior in tts.py, with corresponding validation.

Key updates for `bulbul:v3` / `bulbul:v3-beta`

Add temperature with range validation and default.
Add output and chunking controls:
- output_audio_bitrate
- min_buffer_size
- max_chunk_length
Send temperature only for bulbul:v3/bulbul:v3-beta.
Do not send enable_preprocessing for bulbul:v3/bulbul:v3-beta (it is sent only for bulbul:v2).

Validation and runtime options

Constructor and update_options() now validate all newly added fields.
Errors are raised for invalid enum/range inputs before request dispatch.

Why

Sarvam Bulbul v3 APIs expose additional output and buffering controls that were not previously configurable in the plugin. This PR makes those controls first-class in plugin options while preserving model-specific compatibility behavior.

Detailed changes

New options with defaults
- temperature: float = 0.6 (valid 0.01..1.0)
- output_audio_bitrate: "128k" (allowed: 32k|64k|96k|128k|192k)
- min_buffer_size: 50 (valid 30..200)
- max_chunk_length: 150 (valid 50..500)

dhruvladia-sarvam · 2026-02-23T14:50:43Z

@darryncampbell it would be great if we can have this PR reviewed and merged asap as it contains a very small but very important model access related change as well

Copilot

Pull request overview

This PR updates the Sarvam TTS plugin to support the newer Bulbul v3 model with its additional controls and model-specific parameter behavior. The changes add temperature control, output audio settings, and buffering/chunking controls while properly segregating which parameters are sent to which model versions.

Changes:

Added support for bulbul:v3 model alongside existing bulbul:v2 and bulbul:v3-beta
Added four new configurable parameters: temperature (0.01-1.0), output_audio_bitrate (32k/64k/96k/128k/192k), min_buffer_size (30-200), and max_chunk_length (50-500)
Implemented model-specific parameter handling: temperature sent only for v3/v3-beta, enable_preprocessing sent only for v2

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-26T15:56:18Z

livekit-plugins/livekit-plugins-sarvam/livekit/plugins/sarvam/tts.py

        pace: Speech rate multiplier (0.5 to 2.0)
        loudness: Volume multiplier (0.5 to 2.0) - only supported in v2 for now
+        temperature: Sampling temperature (0.01 to 1.0), only used in v3 and v3-beta
+        output_audio_bitrate: Output audio bitrate (default 128k)


The documentation for output_audio_bitrate should include the allowed values. Consider updating to: "output_audio_bitrate: Output audio bitrate (32k, 64k, 96k, 128k, or 192k; default 128k)"

Suggested change

output_audio_bitrate: Output audio bitrate (default 128k)

output_audio_bitrate: Output audio bitrate (32k, 64k, 96k, 128k, or 192k; default 128k)

Copilot · 2026-02-26T15:56:18Z

livekit-plugins/livekit-plugins-sarvam/livekit/plugins/sarvam/tts.py

+                    "output_audio_bitrate must be one of "
+                    f"{', '.join(sorted(ALLOWED_OUTPUT_AUDIO_BITRATES))}"


The error message format is inconsistent between init and update_options. In init (line 387-388), the message uses 'one of' with a comma-separated list. In update_options (line 547-548), it uses 'one of' followed by the list on the next line without repeating 'one of'. For consistency, both should use the same format.

Suggested change

"output_audio_bitrate must be one of "

f"{', '.join(sorted(ALLOWED_OUTPUT_AUDIO_BITRATES))}"

f"output_audio_bitrate must be one of {', '.join(sorted(ALLOWED_OUTPUT_AUDIO_BITRATES))}"

Copilot · 2026-02-26T15:56:19Z

livekit-plugins/livekit-plugins-sarvam/livekit/plugins/sarvam/tts.py

        pace: Speech rate multiplier (0.5 to 2.0)
        loudness: Volume multiplier (0.5 to 2.0)
+        temperature: Sampling temperature (0.01 to 1.0), used for v3 and v3-beta
+        output_audio_bitrate: Output audio bitrate


The documentation for output_audio_bitrate should include the allowed values for consistency with other parameters. Consider updating the documentation to: "output_audio_bitrate: Output audio bitrate (32k, 64k, 96k, 128k, or 192k)"

Suggested change

output_audio_bitrate: Output audio bitrate

output_audio_bitrate: Output audio bitrate (32k, 64k, 96k, 128k, or 192k)

Copilot · 2026-02-26T15:56:19Z

livekit-plugins/livekit-plugins-sarvam/livekit/plugins/sarvam/tts.py

+        min_buffer_size: Minimum character length for flushing
+        max_chunk_length: Maximum chunk length for sentence splitting


The documentation for min_buffer_size and max_chunk_length should include their valid ranges for consistency. Consider updating to: "min_buffer_size: Minimum character length for flushing (30 to 200)" and "max_chunk_length: Maximum chunk length for sentence splitting (50 to 500)"

Suggested change

min_buffer_size: Minimum character length for flushing

max_chunk_length: Maximum chunk length for sentence splitting

min_buffer_size: Minimum character length for flushing (30 to 200)

max_chunk_length: Maximum chunk length for sentence splitting (50 to 500)

initial

b45a03c

This comment was marked as resolved.

Sign in to view

remove output_audio_codec

a877f0f

kuba-- requested a review from Copilot February 26, 2026 15:52

Copilot started reviewing on behalf of kuba-- February 26, 2026 15:52 View session

Copilot AI reviewed Feb 26, 2026

View reviewed changes

theomonnom approved these changes Feb 26, 2026

View reviewed changes

theomonnom merged commit 69e66fa into livekit:main Feb 26, 2026
13 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

initial#4923

initial#4923
theomonnom merged 2 commits intolivekit:mainfrom
dhruvladia-sarvam:sarvam-v3-tts-addns

dhruvladia-sarvam commented Feb 23, 2026 •

edited

Loading

Uh oh!

This comment was marked as resolved.

Uh oh!

dhruvladia-sarvam commented Feb 23, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Feb 26, 2026

Uh oh!

Copilot AI Feb 26, 2026

Uh oh!

Copilot AI Feb 26, 2026

Uh oh!

Copilot AI Feb 26, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	output_audio_bitrate: Output audio bitrate (default 128k)
	output_audio_bitrate: Output audio bitrate (32k, 64k, 96k, 128k, or 192k; default 128k)

		"output_audio_bitrate must be one of "
		f"{', '.join(sorted(ALLOWED_OUTPUT_AUDIO_BITRATES))}"

	"output_audio_bitrate must be one of "
	f"{', '.join(sorted(ALLOWED_OUTPUT_AUDIO_BITRATES))}"
	f"output_audio_bitrate must be one of {', '.join(sorted(ALLOWED_OUTPUT_AUDIO_BITRATES))}"

		min_buffer_size: Minimum character length for flushing
		max_chunk_length: Maximum chunk length for sentence splitting

Conversation

dhruvladia-sarvam commented Feb 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Key updates for bulbul:v3 / bulbul:v3-beta

Validation and runtime options

Why

Detailed changes

Uh oh!

This comment was marked as resolved.

Uh oh!

dhruvladia-sarvam commented Feb 23, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

dhruvladia-sarvam commented Feb 23, 2026 •

edited

Loading

Key updates for `bulbul:v3` / `bulbul:v3-beta`