@@ -69,7 +69,17 @@ Transcribe audio and video files to text using leading AI providers. Supports mu
6969
7070#### Output
7171
72- This tool does not produce any outputs.
72+ | Parameter | Type | Description |
73+ | --------- | ---- | ----------- |
74+ | ` transcript ` | string | Full transcribed text |
75+ | ` segments ` | array | Timestamped segments |
76+ | ↳ ` text ` | string | Transcribed text for this segment |
77+ | ↳ ` start ` | number | Start time in seconds |
78+ | ↳ ` end ` | number | End time in seconds |
79+ | ↳ ` speaker ` | string | Speaker identifier \( if diarization enabled\) |
80+ | ↳ ` confidence ` | number | Confidence score \( 0-1\) |
81+ | ` language ` | string | Detected or specified language |
82+ | ` duration ` | number | Audio duration in seconds |
7383
7484### ` stt_deepgram `
7585
@@ -89,7 +99,18 @@ This tool does not produce any outputs.
8999
90100#### Output
91101
92- This tool does not produce any outputs.
102+ | Parameter | Type | Description |
103+ | --------- | ---- | ----------- |
104+ | ` transcript ` | string | Full transcribed text |
105+ | ` segments ` | array | Timestamped segments with speaker labels |
106+ | ↳ ` text ` | string | Transcribed text for this segment |
107+ | ↳ ` start ` | number | Start time in seconds |
108+ | ↳ ` end ` | number | End time in seconds |
109+ | ↳ ` speaker ` | string | Speaker identifier \( if diarization enabled\) |
110+ | ↳ ` confidence ` | number | Confidence score \( 0-1\) |
111+ | ` language ` | string | Detected or specified language |
112+ | ` duration ` | number | Audio duration in seconds |
113+ | ` confidence ` | number | Overall confidence score |
93114
94115### ` stt_elevenlabs `
95116
@@ -108,7 +129,13 @@ This tool does not produce any outputs.
108129
109130#### Output
110131
111- This tool does not produce any outputs.
132+ | Parameter | Type | Description |
133+ | --------- | ---- | ----------- |
134+ | ` transcript ` | string | Full transcribed text |
135+ | ` segments ` | array | Timestamped segments |
136+ | ` language ` | string | Detected or specified language |
137+ | ` duration ` | number | Audio duration in seconds |
138+ | ` confidence ` | number | Overall confidence score |
112139
113140### ` stt_assemblyai `
114141
@@ -132,7 +159,30 @@ This tool does not produce any outputs.
132159
133160#### Output
134161
135- This tool does not produce any outputs.
162+ | Parameter | Type | Description |
163+ | --------- | ---- | ----------- |
164+ | ` transcript ` | string | Full transcribed text |
165+ | ` segments ` | array | Timestamped segments with speaker labels |
166+ | ↳ ` text ` | string | Transcribed text for this segment |
167+ | ↳ ` start ` | number | Start time in seconds |
168+ | ↳ ` end ` | number | End time in seconds |
169+ | ↳ ` speaker ` | string | Speaker identifier \( if diarization enabled\) |
170+ | ↳ ` confidence ` | number | Confidence score \( 0-1\) |
171+ | ` language ` | string | Detected or specified language |
172+ | ` duration ` | number | Audio duration in seconds |
173+ | ` confidence ` | number | Overall confidence score |
174+ | ` sentiment ` | array | Sentiment analysis results |
175+ | ↳ ` text ` | string | Text that was analyzed |
176+ | ↳ ` sentiment ` | string | Sentiment \( POSITIVE, NEGATIVE, NEUTRAL\) |
177+ | ↳ ` confidence ` | number | Confidence score |
178+ | ↳ ` start ` | number | Start time in milliseconds |
179+ | ↳ ` end ` | number | End time in milliseconds |
180+ | ` entities ` | array | Detected entities |
181+ | ↳ ` entity_type ` | string | Entity type \( e.g., person_name, location, organization\) |
182+ | ↳ ` text ` | string | Entity text |
183+ | ↳ ` start ` | number | Start time in milliseconds |
184+ | ↳ ` end ` | number | End time in milliseconds |
185+ | ` summary ` | string | Auto-generated summary |
136186
137187### ` stt_gemini `
138188
@@ -151,6 +201,12 @@ This tool does not produce any outputs.
151201
152202#### Output
153203
154- This tool does not produce any outputs.
204+ | Parameter | Type | Description |
205+ | --------- | ---- | ----------- |
206+ | ` transcript ` | string | Full transcribed text |
207+ | ` segments ` | array | Timestamped segments |
208+ | ` language ` | string | Detected or specified language |
209+ | ` duration ` | number | Audio duration in seconds |
210+ | ` confidence ` | number | Overall confidence score |
155211
156212
0 commit comments